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Introduction 


The class of polynomials is an extremely important class of functions, both in theoretical and in applied 
mathematics. The definition of a polynomial is so simple that one may believe that everything is trivial 
for polynomials. Of course, this is far from the truth. For instance, how can one numerically find the 
solution of the equation 


(1) 3287-22 +1=0 


within a given (small) error ¢? An application of Rouché’s theorem (cf. Section 3.5) shows that all 87 
roots lie in the narrow annulus 0.96 < |z| < 1, so it will be very crowded in this annulus concerning 
the roots. In principle we set up some guidelines in this book so the roots can be found. The task, 
however, is far beyond the scope of the present volume, so it is left to the few interested readers to use 
the methods given in the following and a computer in order to find the 86 complex roots remaining 
after we have found the only real root in Section 4.1. 


In the first chapter we describe some results on polynomials in general, before we in the next three 
chapters proceed with the main subject of this book, namely to find the zeros of a polynomial. The 
topics are (mainly following the contents of the chapters, but not strictly) 


1) Explicit solution formulee 


e The fundamental theorem of algebra 
e The binomial equation 

e The equation of second degree 

e Rational roots 

e Multiple roots 


e The Euclidean algorithm, i.e. common roots of two polynomials 
2) Position of roots of polynomials in a complex plane (classical results) 


Descartes’s theorem 


e Fourier-Budan’s theorem 
e Sturm’s theorem 
e Rouché’s theorem 


e Hurwitz polynomials 
3) Approximation methods 


e Newton’s iteration method 


e Graeffe’s root-squaring method. 
We shall occasionally in a few topics assume some knowledge of Complex Functions Theory. 


All topics of this book have been known in the literature for more than a century. Nevertheless, it is 
the impression of the author that they are no longer common knowledge. One example is Graeffe’s 
root-squaring method to find numerically roots why lie very close to each other in absolute size. It can 
in principle be used to find the 86 complex roots of (1), but the work will be so large that it cannot 
be included here. 
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Rouché’s theorem and Hurwitz’s criterion of stability and their applications are well-known in Stability 
Theory and among mathematicians, but in general, engineers do not know them. This is a pity, because 
the can often be used to limit the domain, in which the roots of a polynomial are situation. It is, e.g., 
by two very simple applications of Rouché’s theorem that we can conclude that all the roots of (1) lie 
in the open annulus 0.96 < |z| < 1. 


Another extremely important theorem, which to the author’s experience is not commonly known 
by engineers, is Weierstrap’s approximation theorem. It states that every continuous function f(t) 
defined in a closed bounded interval I can be uniformly approximated by a sequence of polynomials. 


More explicitly, for every given ¢ > 0 and every given continuous function f(t) on I one can explicitly 
find a polynomial P(t), such that 


(2) |f()-—P(|<e — foreveryte I. 
This means in practice that if the tolerated uncertainty is a given ¢ > 0, then we are allowed to 


replace the continuous function f(t) by the polynomial P(t), given by Weierstrap’s approximation 
theorem. This is very fortunate for the use of computers, which strictly speaking are limited to only 
work with polynomials, because only a finite number of constants can be stored in a computer. As 
indicated above there even exists an explicit construction (Bernstein polynomials) of such polynomials 
P(t), when f(t) and € > 0 are given, such that (2) in fulfilled. One can prove that these Bernstein 
polynomials are not the optimum choice, but in general they are “very close” to be it. 


Since we want to emphasize this very important theorem of Weierstraf, although it is not needed in 
the text itself, it has been described in a section of the Appendix. 


Errors are unavoidable, so the author just hopes that there will not be too many of them. 
October 5, 2011 


Leif Mejlbro 
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1 Complex polynomials in general 


1.1 Polynomials in one variable. 

A complex function of the form 

(3) P(z) = ap2" + ay2™—* +--+ + apie + Ga, ao,---,4n € C, a9 £0, constants, 
in the complex variable z € C is called a polynomial of degree n. 


When the polynomial is restricted to the real axis, we shall often write P(x), x € R, instead of P(z), 
though we may in the later chapters also from time to time use the notation P(x) for « € C complex. 
Sometimes we shall allow ourselves to omit “x € R” or “x € C”, etc., where it is obvious, whether x 
is real or complex. 


The following two results are well-known. 
Proposition 1.1.1 A polynomial P(z) is continuous everywhere in C. 


PRooF. It suffices to prove that every monomial z” is continuous at every fixed z € C. It follows 
from the binomial formula, cf. Appendix 5.1, that 


Deloitte. 


Discover the truth at www.deloitte.ca/careers © Deloitte & Touche LLP and affiliated entities. 
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n n—-1 

n n __ n n-j i n n—1-—4 : 

(4) (zo + Az) “B= (5) a favaaey’ (1) 8 Te, 
j= j= 


so (29 + Az)” — 2 — 0 for Az — 0, ie. (zo + Az)” — 2? for Az — 0, and the proposition follows. 


Proposition 1.1.2 A polynomial P(z) of degree n is continuously differentiable everywhere in C, and 
its derivative P'(z) is a polynomial of degree n — 1. 


ProoF. It suffices again just to consider a monomial z”. Then by (4), 


- noon n—1 ; n—2 , ; 
(20+ Az)" = 20 2) BY ig S- es ) zy 9 Az = nay +z) 0 ee ) an? J Add, 
Zz é 
jJ= 


= j=0 
hence, 
n 
_ (zo + Az)" — 26 al 
lim <2 _ = NZ 
Az—0 Az 


Using this result and the linearity it follows that the derivative of P(z) given by (3) is 


P'(z) = nag 2") + (nm — Day 2? +--+ + api, 


which is a polynomial of degree n — 1, and the proposition is proved. 
It is very important that the description (3) of a polynomial P(z) is unique. This follows from 
Theorem 1.1.1 The identity theorem. Two complex polynomials P(z) and Q(z) which are equal to 
each other for every z € C, have the same degree n and the same coefficients ag, a), ..., Gn € C as 
given in (8). 
Proor. Assume that 

P(z) = 092" +a)2" 1 + +++ + ay and Q(z) = bz” + byz7-* +++ + by. 


If necessary, we have here supplied with zero terms, so that the n+ 1 coefficients become ag,..., dn 
and b,...,bn, even if e.g. bp = 0, etc.. We shall prove that if P(z) = Q(<) for all z € C, then a; = b; 
for every 7 = 0,1,..., 7. 


If we choose z = 0, then we get ay, = P(0) = Q(0) = bn, thus ay, = bp, and it follows by a reduction 
that 


Ope + aye eh ap ae pe HO hs for all z EC. 
When z # 0, this equation is equivalent to 
(5) agz™—* + age? +++ + ap = be | + bz 2 + + bya, for z € C \ {0}. 


However, due to the continuity, (5) also holds for z = 0. 


Repeating this process we get successively ajn—1 = bn-1, ---, @o = bo, and the theorem is proved. 
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Remark 1.1.1 A similar argument shows that if two convergent power series (same point of expan- 
sion) are equal in their common domain of convergence, then they have the same coefficients. See also 
Appendix 5.2. © 


It is convenient to define the zero polynomial as the function Q(z) = 0 and use the polynomial 
description 


Q(z) =0-27+0-2"-1+--.40-240, 


whenever necessary, although this is not in agreement with the definition (3). We shall also say that 
the zero polynomial has the degree —oo. We obtain by this convention that the degree of a product 
of polynomials is equal to the sum of the degrees of the polynomials, i.e. 


deg(P(z) - Q(z)) = deg P(z) + deg Q(z), 


even if one of them is the zero polynomial. Here, deg P(z) denotes the degree of the polynomial P(z). 


1.2 Transformations of real polynomials. 
If all coefficients ag,...,@y of (3) are real, we say that P(z) is a real polynomial 
P(z) = agz” AQge Listed Ge a ap, ao,.--,4, € Rand zeC. 


This is of course an abuse of the language, because only the coefficients are real, and P(z) is not a 
real number for general z € C. However, if z = « € R, then P(z) is always real. 


We shall in the following show some simple transformation rules of real polynomials in a real variable 
x €R. These rules also hold for a complex variable z € C, but for clarity we shall only consider P(x), 
x €R, in the discussions below. 


1.2.1 Translations. 


The chance of variable is here given by = y +k, where k € R is some real constant. If P(«) has 
degree n, then it follows by a Taylor expansion from k, cf. Appendix 5.4.1, that 


POR) 2 POV) P\(k) 
"tr — 1)! e 


The most commonly used translation is given by 


m—ly 4 
T T 


Qify) := Pytk)= 


n! 


fein 


? 
Tao 


by which the coefficient by of y”~! of Qi(y) becomes 0. 


There is a reason why one usually only uses the translation above. In principle, we can set up a set of 


equations, such that any given b;, 7 = 2,...,n, in Qi(y) becomes zero. Unfortunately, the equations 
in the unknown translation parameter & will in general be increasingly difficult to solve, i.e. 
PO-jI(k 
per ey for 9 = 2y.i6 4: 
(n— j)! 


For 7 = n we see that we shall solve P(k) = 0, ie. find a zero of the polynomial, and we know that 
this is in general not possible to find in all cases by an exact solution formula. 
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Example 1.2.1 A polynomial can always be normalized by dividing it by ag 4 0. We may therefore 
assume that ao = 1, so let us consider the polynomial of third degree, 


P(x) = 2° +. ax? + apr + az, a1, a2, a3 ER, reER. 
Let k € R. Then by the translation « = y + k, 
P(t) = 2? +a,2? + ag¢+a3 = (y+k)? + a:(y+ k)? + a2(y+k) 4+ a3 
(6) = y+ {3k+a}y? + {3k? + 2ka, + ag} y + {k® + ak? + agk +3} 


= y by? + boy + bs. 
The identity theorem gives 


b} = 3k+ a, bo = 3k? + 2ka, + ao, bs = k? + ak? + agk +43. 
ay 

a 

2) Choosing bz = 0 we get 3k? + 2ka; + a2 = 0, hence 


—2a, + ,/4a? — 12 1 
k= a a Ae i { a+ ya} — Baa} 


6 3 


1) Choosing 6; = 0 we get k = — 


3) Choosing b3 = 0 it follows that this is the same as finding the zeros of the polynomial. 
% 


1.2.2 Similarities. 


For given k € R \ {0}, the similarity of factor k is defined as the change of variable 


y 
y x, us t= 7 


In this case the transformed polynomial becomes equivalent to 
Qa(y) = k” P(2) =agy” tarky” | + ak? y"? +++ +an-1k" ty + ank”, 
where we for convenience have multiplied by k”. 


Clearly, if xo is a root of P(x), i.e. P(xo) = 0, then yo = k xo is a root of Qea(y), so similarities may 
be used to scaling. 


If all coefficients ao,...,@n € Q of P(x) are rational numbers, then we can choose k € N so large that 
the equivalent polynomial 
~ 1 ak agk? nk” 
Qa(y) = — Qa(y) =y® + —— yt $y Ft 
ag ag ao ag 
is normalized, bp = 1, and all other coefficients b1,...,b, € Z are integers. 


A polynomial is called normalized, if its coefficient bj) = 1 of the term of highest degree. We see 
that every polynomial of rational coefficients can be transformed into a normalized polynomial by a 
similarity after a division by ag # 0. 
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1.2.3 Reflection in 0. 
The reflection is given by the change of variable y = —x, x = —y, so 
Qs(y) = P(—y) = (-1)"aoy” + (—1)" tary"? +++ + @n-2y? — Gn-1y + Gn. 


This can of course also be considered as a similarity where k = —1. All coefficients of odd index 
change sign, while all coefficients of even index are unchanged. If zo is a (real) root of P(x), then 
Yo = —Xo is a (real) root of Q3(y), so all real roots change their sign by a reflection. 


We shall later give some criteria concerning real positive roots. Reflection can be used to obtain 
similar results for real negative roots. 


1.2.4 Inversion. 


1 1 
The inversion is given by the change of variable = —, y = —, where we must require that « 4 0 and 
x 


¥y 
y #0. When we multiply by y” 4 0, we obtain the equivalent polynomial 
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1 
Qaly) = y” P(2) = Gny" + On-1y” 1 +++: tary tao, 


so the coefficients are here given in the reversed order. 


If 29 £0 is a root of P(x), then yo = 1/29 is a root of Qa(y), and if yo £ 0 is a root of Qa(y), then 
Lo = 1/yo is a root of P(x). 


1.3. The fundamental theorem of algebra. 


It is very difficult, if possible at all, to make a serious investigation of the polynomials without being 
able to refer to the Fundamental theorem of Algebra. We shall therefore in this section prove this 
important theorem, before we start on other deeper results. 


We have already used the terminology that zo € C is a root or a zero (both names are used in the 


following) of a polynomial P(z), if P (zo) =0. 


Theorem 1.3.1 The fundamental theorem of algebra. Every polynomial P(z) of degree > 1 has at 
least one root z=) € C. 


The following proof is often called Cauchy’s proof in spite of the fact that it is actually due to Argand, 
1815. The first attempt of a proof goes back to d’Alembert in 1746, and the theorem is therefore also 
called d’Alembert’s theorem. 


PROOF. Consider the polynomial 
P(z) = agz” +ay2™ * + ++++ @n_12+ Gn, n>1land ao £0. 


Clearly, P(0) = ao, so if a, = 0, then z = 0 is a root. We may therefore in the following assume that 
also Gyn, # 0. 


Let |z| =r > 0. Then we have the estimate 


|P(z)| = lao r® — Jaa] r"~* — Jag] r®? — +++ = Jana] r= Jan 


ay ag Gn-1 a 
(7) = r* {ao (is ace a 


When r — +00, the right hand side of (7) tends towards +oo. In particular, P(z) £ 0, if |z] =r>A 
is sufficiently large. We choose A, such that |P(z)| > |an|, if jz] =r > A. 


The real function |P(z)| is continuous on the closed bounded dise {z € C | |z| < A}, so by one of the 
main theorems of continuous functions, |P(z)| must have a minimum in this disc, so there exists a 
zo € C, where |zo| < A, such that 


|P (zo)| < |P(z)| for every z € C, for which |z| < A. 
It follows in particular that 


|P (z0)| < |P(0)| = lan], 
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and since |P(z)| > |a,| for z] = A, we conclude that |zo| < A, so 29 lies in the interior of this disc. 


We shall prove that P (zo) = 0. This is done contrariwise, i.e. we assume instead that P (zg) 4 0, and 
then we derive a contradiction. 


P@)| > |a,| 


Figure 1: The discs in the proof of Theorem 1.3.1. 


Choose go, such that 0 < @9 < A— |zol, cf. Figure 1. Then 
Bi={z€C||z— 20] < oo} C {2 EC | |2| < A}, 


so B is an open subset of the closed disc defined by |z| < A. We put z = zo +h for z € B, so |h| < @p. 
Then 


P (zo +h) = boh™ + bh? +--+ + bat bp, by = P (zo) £0 and bp = ap £0. 


Choose 7 € {0,1,...,2 — 1}, such that b; A 0 and b, = 0 fork = 7 +1,...,n—1. Then, since 
P (20) = bn, 


P(zo +h) =P(z0)+boh"+---+bj;h™%, — where bj £0. 
We write h in polar coordinates, 
h=oe®, where 0 < 0 < 09 < A— [Zo], 
thus |z + h| < A. 
Also, write P (zo) and 6; in polar coordinates, 
P (zq) = |P (z0)| e” and b; = |b;| e”. 
Then 
P (zo +h) =|P(2)| e'? + [>;| oF etl +(n-N®) 4 bj-1 i Mamas aes ree 8 se nd 


Choose 9, i.e. the angle of h, such that w + (n— 7)0 = y+4+7. Then 


P (Zz mc h) => {|P (z)| = |b; pe} el? + bj-1 pr-stt feeet bo h”. 
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Then choose g so small that also |b;| 9” < |P (zo)|, so we get the estimates 


IP(zo+h)| < |P(z0)|—|bj| 0" 2 + |bj-1| 0” 247 +--+ + [bo] 0” 


(8) = |P (z0)|— "7 {[b;| — |bj-1l e- +++ — [bol @} - 
For some smaller @ > 0 we can obtain that also 

|bj| — 1b;-1] @— +++ — |bo| @” > 0, 
hence, for such a @ we have 

|P (20 + h)| < |P (20)|, 


which is contradicting the assumption that |P(zo)| was a minimum. Hence, the other assumption 
that P (zo) 4 0 must be wrong, and we finally conclude that P (zo) = 0. 


Corollary 1.3.1 The fundamental theorem of algebra. Every polynomial P(z) of degree n > 1 is, 
apart from the order of the factors, uniquely factorized in the following way, 


(9) P(z) = a(z— a1) (z—a2)-++(z— an), a #0. 

In particular, if P(z) has degree n, then P(z) has precisely n roots (counted by their multiplicities). 
It is obvious that Corollary 1.3.1 implies Theorem 1.3.1. We shall prove that Theorem 1.3.1 also 
implies Corollary 1.3.1, so the two results are indeed equivalent. 


Proor. 1) Existence. Assume that the polynomial P(z) has degree n > 1. Then it follows from 
Theorem 1.3.1 that it has a root a1, thus P (a1) = 0. 


By a Taylor expansion from a1, cf. Appendix 5.3, we get 


“(a1 "(ay 2 (n) ay n 
P(2) = AED (e — an) + ISH (2 = ca)? + IY (2 — an)” = (2 a1) - le), 
where 
"(ay ” (ag (") (ay n-1 
P(2) = i Lies Po ee a | ) Q1) 


is a polynomial of degree n — 1. 
When we apply the same method on P;(z) we get similarly 
P,(z) = (z — a2) - Po(z), Le. P(z) = (2 — a1) (z — a2) Po(z), 
where P2(z) is a polynomial of degree n — 2. 
We proceed in this way, and after n steps we have obtained (9). 


2) Uniqueness. Assume that we have two representations of P(z), 


P(z) = a(z— a4) (2 — a2)-++(2 — On) = b(z — Bi) (2 — Bo) ++ (2 — Bm) » 
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where a £0 and 6b £0. 


The terms of highest order are az” and bz™, respectively, so it follows from Theorem 1.1.1, The 
identity theorem, than n = m and a= b. 


If we choose z = Qj, the left hand side becomes zero, P (a,) = 0. Hence, a; must also be a root of 
the right hand side, i.e. a; must be one of the n numbers /),....0,. Changing indices, if necessary, 
we may assume that a; = (1. 


When z # a1, it follows from a division by a(z— aj) that 


(z — ag) +++ (2 — An) = (2 — B2)++ (2 — Bn), z€C\ {ai}. 


Due to the continuity this equation also holds for z = a;. Then proceed as above, i.e. the root az on 
the left hand side must be one of the remaining numbers (2,...,8n, SO @2 = (G2 after another change 
of index, etc.. Continue in this way n times, until we get the triviality 1 = 1, and the corollary is 
proved. 
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The n numbers a1, Q2,...,Q@, of Corollary 1.3.1 are all roots of P(z). They need not be mutually 
different; some of them may be multiple roots. In some situations it is better to preserve (9), i.e. 


P(2) = a(z— ay) (z—a2)-+-(2- an), 


even if there are repetitions among the factors, but in most cases we prefer to collect identical factors, 
so 


(10) P(z) =a(z—a)"-+-(z-a,y)"", Mtr tn =n, 


where n,; € N is called the multiplicity of the root a;, and where qj,...,a, in (10) are the mutually 
different roots of P(z). 


Ifn; = 1, then the corresponding root a; is called a simple root. Ifn; > 1, we say that a; is a multiple 
root. In case of n1 = 2 we also call a; a double root. 


Finally, (9) is also true for constants 4 0, i.e. for polynomials of degree 0, because there is no factor 
of degree 1 in this case. This corresponds to the obvious fact that a constant polynomial 4 0 does 
not have any root. 


1.4 Vieti’s formulze 


Let a1,...,Q@n € C denote the n roots of the polynomial 


(11) P(z) = agz” + a,2"7* 4 +++ + Gn-1Z + Gn, a0,+++,4n ec, do # 0. 


Then also 
P(z) = a(z—ay1)(z-—a2)--:(z— an) 
(12) = aoz”—ag(ai t+ ag++->+a,) 27 1 + ap (Q1a2 + a103 +--+: + On—10n) 2" 


nee ere + (—1)"apaia2 8+ Mn. 


When we identify the coefficients of the two representations (11) and (12) of P(z), we get Vieti’s 
formule in the n complex variables a1, a2,...,Qn, 


bi = =-—{a,+ag+::-+an}, 
ao 
a2 

by = — = + {ayag + a103 + +++ + Qn_-10n} 
ao 
a3 

(13) bs = — = — {ayaga3 + a1aga4 + +++ + An—2An-10n}, 

ao 
an 

bp = — = (-1)" a1 02°++ An. 
ao 


The formule of (13) are also called the elementary s ymmetric polynomials in the n complex variables 
A1,--+,An.- 
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Using (13 we easily prove 


Theorem 1.4.1 Assume that the polynomial 

P(z) = 092" +921 + + + Gpizt Gn, do, @1,---,An ER, a £0, 
has real coefficients. If 
(14) a? — 2apaz < 0, 


then P(z) has complex, non-real roots. 


PROOF. It follows from (13) and (14) that 


= {—(a, +a9 +--+ an) = {ara +0103 +++) + On-1dp} = 0? + 28 ++ + 02. 


Since the sum of the squares of all roots is negative, they cannot all be real numbers. 


Remark 1.4.1 It follows actually from the proof that it suffices only to require that b; and b2 are 
real, and then of course (14). The remaining coefficients may be complex. > 


Example 1.4.1 Every polynomial of the special form 
P(z) = 002" +0-2" 1 + age” 7 + +++ + ap, ao, @2,.--, An ER, 


where a; = 0, and where ao and ag have the same sign, i.e. dga2 > 0, must necessarily have complex 
roots. This follows immediately from Theorem 1.4.1, because then 


a — 2agag = —2ana2 < 0. 
If in particular we choose ap = ag = 1 and a, = 0, then it follows that every polynomial of the form 
BP gt gee? aga, a3,..-,4dn EC, 


must have complex roots. One simple example is the well-known z? +1. > 


1.5 Rolle’s theorems. 


In this section we show some variants of the well-known Rolle’s theorem, when it is restricted to 
polynomials. We shall first prove the general r esult. 


Theorem 1.5.1 Rolle’s theorem. Assume that f(t) is a real continuous function defined in a closed, 
bounded interval [a,b]. Furthermore, assume that f is continuously differentiable in the interior in- 
terval ja, b[. If f(a) = f(b) =0, then there exists at least one point € €]a, bl, such that f’(€) =0. 
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Figure 2: Rolle’s theorem. 


PROOF. We shall give a proof which is very similar to the proof of Taylor’s formula in Appendix 5.3. 
It follows from f(a) = 0 that 


f(x) = f(a) +f fMd= i. f(tj)dt fora<a<b. 


We get in particular for xz = b, 


b 
(15) f(6) =0= | f(t = dt. 


If f’(t) =0 in [a,b], there is nothing to prove. 


If f’(t) is not identically 0, then f’(t) must have both positive and negative values in Ja, b[, since 
otherwise (15) could not be satisfied. By assumption, f’(t) is continuous in Ja, b[, so there must exist 
(at least) one € € Ja, b[, such that f’(€) = 0. 


We shall in the following choose f = P as a real polynomial (i.e. of real coefficients) of degree n in 
the real variable x € R, 


(16) P(e) = 09x" + ayx” 1 +++ + ap, do,---,4dn ER, ap #0, ER. 


Theorem 1.5.2 Let P(x) be gives as in (16), and let a < b be two succeeding real zeros of P(x). 
If the roots are counted according to their multiplicity, then the derivative P’(x) has always an odd 
number of zeros in the interval ja, b[. 


ProoF. We first assume that all roots of P’(x) are simple. Since P(x) does not have zeros in Ja, b[, 
we may assume that P(x) > 0 in Ja, b]. 


The zeros of P’(a) divide Ja, b| into open subintervals, in which P’(x) is alternatively positive and 
negative. Since P(x) > 0, we must have P’(x) > 0 in the subinterval, which has x = a as its left 
bound, and P’(x) < 0 for x in the subinterval, which has x = 6 as its right bound. 
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Figure 3: The sign of P’() in [a, 6). 


It follows that we have the following variation of the sign of P’(a), 
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We notice that between two successive plus signs there are always two zeros of P’(x). This implies 
that the final zero to the right cannot be paired with another zero. The sequence ends with +, 0, —, 
and the number of zeros of P’(a) must therefore be odd in this case. 


Then allow that P’(x) has a higher order of zero in the case of either +, 0, —, or —, 0, +. This implies 


that the order of the zero must necessarily be odd so in the count of the zeros we replace 1 by some 
odd number. This does not change the conclusion of the theorem. 


Finally, if the variation of sign of P’(a) is either +, 0, +, or —, 0, —, then the zero must be of even 
order. In this case we replace the variation +, 0, + by + alone, and the variation —, 0, — by — alone, 
where we in both cases add an even number to the number of zeros. This process will not change the 
conclusion either, and the theorem is proved. 


Theorem 1.5.3 Let P(x) be the polynomial (16), and let a < ( be two successive real zeros of the 
derivative P'(x). 


1) If P(a)- P(B) > 0, then P(x) has no zero in ja, GI. 
2) If P(a)- P(B) <0, then P(x) has precisely one zero in |a, (I. 


Proor. 1) Assume that P(a), P(G) > 0, and that there is a € € Ja, 6[, such that P(£) = 0, so we aim 
at getting to a contradiction. 


It follows from the assumption that there exists a y €]a,é[, such that P’(7) < 0, and a uw E]€, JI, 
such that P’(w) > 0. Since P’(x) is continuous, there also exists a v €]y,u[, such that P’(v) = 0. 
Then we have also v €]a, 3[, which contradicts the assumption that a and ( are successive zeros of 
P'(x). Hence, P(x) # 0 for every x € Ja, GI. 


2) Assume that P(a)- P(G) < 0, so P(a) and P(@) have different signs. The continuity of P(x) 
implies that there exists a zero € € Ja, {| for P(x), thus P(€) = 0. 


Assume that there exists another zero in the interval, e.g. 4 €]a,é[, such that P(w) = P(€) = 0. 
Then Theorem 1.5.2 implies that there exists another zero v €]y,€[ of P’(x). Since v €]a, f|, this 
contradicts the assumption that a and £ are two successive zeros of P’(x). 


Corollary 1.5.1 Let P(x) be the polynomial (16). If the derivative P’(x) has p complex roots (i.e. 
non-real roots), then the polynomial P(x) itself has at least p complex roots. 


ProoFr. The degree of P(x) is n, so P’(x) has degree n — 1, and P’(x) has by assumption n — 1 — p 
real roots. 


It follows from Theorem 1.5.2 and Theorem 1.5.3 that the polynomial P(a) has at most one extra 
real root, thus at most n — p real roots of P(x), and hence at least n — (n — p) = p complex roots of 
P(a). 


Corollary 1.5.2 Let P(x) be the polynomial (16). The number of complex roots of the derivatives 
P(x), 7 =0,1,...,n, is a weakly decreasing function in j. 
Here we have put P(x) := P(2). 
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ProoF. This follows immediately by successive applications of Corollary 1.5.1. 


Example 1.5.1 If we can find the roots of the derivative P’() of a polynomial of real coefficients, 
then Rolle’s theorems can be applied to find where the real roots of P(x) are situated on the real axis. 
This is always possible, if P(x) has degree 3, or if P(a) has degree 4, where the term of degree 3 is 
missing. The latter condition can always be obtained by using a translation, cf. Section 1.2.1. 


We shall illustrate this in the following by some examples. 


“|S 


1) Consider the polynomial P(x) = x? — 2x — 5. Then P’(x) = 3x? — 2, which has the roots + 


We find by insertion the following variation of sign of P(x), 


v6 V6 
g. 3 


v6 


from which we conclude that there is only one real root and that it is > as 


2) The polynomial P(x) = 2° + x? — 5x +3 has the derivative P’(x) = 3x? + 2” — 5, which has the 
5 
roots = and +1. We get by insertion the following variation of sign of P(x). 


1 + +00 


0 =F 


5 
from which follows that x = 1 is a double root and that there also is a real root < —=. This is 


easy to find by Vieti’s formulae, because the sum of the roots, 1+1+a, must be —a; = —1, hence 
a= -—-3. 


w 
we 


Finally, let P(x) = «+ + 12x? + 96x — 12. Then 
a? — 2aja2 = —24 < 0, 


so it follows from Theorem 1.4.1 that we have at least two complex roots. Since P(0) = —12 < 0 
and P(x) — +00 for « — +oo, we must also have at least two real roots. Finally, the total number 
of roots is 4 by the Fundamental theorem of algebra, so we conclude that we have two real and two 
complex roots. 
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2 Some solution formulz of roots of polynomials 


There are very few exact solution formulee of a polynomial equation P(z) = 0. The reason is of course 
Niels Henrik Abel’s result that P(z) = 0 in general cannot be solved by root signs, if deg P > 5. (It 
may of course occasionally be solvable). We shall in this chapter give the exact solution formule in 
the cases of the binomial equation and the equation of second degree. 


There exist exact solution formule: for equations of third and fourth degree, but these are absolutely 
not of any reasonable computational value, so although they are classical, we shall not give them here. 


Finally, we give some useful partial results, assuming either that we have a rational root or a multiple 
root. 


2.1 The binomial equation. 


The simplest possible non-trivial polynomial equation is the binomial equation in polar coordinates, 
2" =a=r-exp(i{O + 2pz}), r>OandpeZ. 


Its n roots are given by 
0+2 0+2 0+2 
(17) 2= Frexp(i2 7") — yr fcos(S PPT) 4 i-sin( SET) b p=0,1,...,n—1. 
n n n 


That everyone of the n numbers of (17) are roots, follows by insertion. That they are mutually 
2 

different for r > 0 follows from the fact that they all lie on a circle of radius ~/r with the angle au 
n 


between two adjacent roots, cf. Figure 4. 


Figure 4: The six roots of 2® = —-1. 


Finally, it follows from the Fundamental theorem of algebra, cf. Corollary 1.3.1, that the equation has 
precisely n roots. 


Ww 
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The geometry of (17), cf. Figure 4, can be exploited in the following way: Rewrite (17) as follows, 


ie) 2i 2i 
z= vF-exp(ie) -exo( 7) = #9 exp( Pr), p=0,1,....»-1, 
n n n 


(2) 
where 29 = */r-exp ( =) is anyone of the n possible solutions. Then the other roots are found, when 
n 


we successively multiply zo by 


(*) (=) os (=) 
exp = cos +2-sin - 
n n 


1,2,...,n—1 times. 
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Example 2.1.1 We shall solve the binomial equation 


= 2-2. 
Figure 5: The three roots of z? = —2 — 2i. 
Since |a| = | — 2 — 2i] = 2V2 = {V/ ae it follows that all three roots lie on a circle of centre 0 and 
radius /2. 


3 
Then it follows from 0 = Arg a = Arg(—2 — 27) = st that one of the solutions is given by 


A= v2 exp(-i “) = v2 {cos(-“) +74 sin(—7) } =1-i. 


4 4 
bx : . ; bs Gi aps 271 
The remaining two roots are either found geometrically, cf. Figure 5, or by multiplication by exp ce 
po 
and exp (=) , Tesp., thus 
_ Qi ~(a—i) 1, .v3|_ v3-1, .v3+1 
Z2 = z1 exp 3 = a gy = 5 + 4 5? 
and 
a ee v3 V3+1  .V3-1 és 
= = | => 1 a 
23 = 21 exp 3 a 5 a 5 5 5 
: : ‘ , 27 . (20 
It is customary in the solutions to give the exact values of cos | — } and sin = forn = 2, 3, 4, 6, 8, 12, 
nm 


cf. Table 1, where we have also added n = 5 and n = 10 for completeness. It should be mentioned 
that such exact expressions using square roots do not exist for n = 7,9, 11, 18, 14, but again for 
n = 15, 16, 17, where, however, they are too complicated to have any practical use. 
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2 | cos(7) =—-1 sin(t) =O 
3 cos( =) — ! sin( =) = v3 
= 2 o 2 
4 cos( 4 = 0 - sin(> =1 
5 cos( =) = sie sin (=) = ee 
1 3 
6 cos (7) = i, sin() = 2 
W . (Ww 
8 cos (“) = a sin(7) = 
10 cos(2) _ _ sin(") _ v10 : 2/5 
12 cos (7) = ve sin(7) = ; 


on rormulee 


2 2 
Table 1: Table of some exact values of cos (=) and sin (=) ; 
n n 


of roots of polynon 


If in particular the exponent is n = 2 in the binomial equation, then it is possible to give exact solution 
formule in the rectangular coordinates without using polar coordinates. 


Theorem 2.1.1 Given the equation 


2 


1) If 8 >0, 


2) If 8 <0, 


3) If 8=0, 


z= (¢£+iy) =a=a+if. 


then the solutions are 


then the solutions are 


i a 
2 2 


; i of +P +0 cae 
= 2 2 


then the solutions are 


In all three cases we define the square root of a positive number as a positive number. 
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ProoF. It suffices to prove 1), because 2) and 3) are proved similarly. We shall clearly only check 
the candidates of the solutions. The plus/minus sign gives us two possible solutions, and a squaring 
finally gives 


" i ae 
7 2 2 


_VC+RP+a  Yart+h-a_,. Cae in ee 
= 5 5 + Qi i =a+iB=a, 


and 1) is proved. 


2 


Example 2.1.2 In practice, Theorem 2.1.1 rarely gives “nice” solutions, though it occurs in special 
cases. Clearly, the equation 


2=384+4 has the solutions + (2 +14), 


and the equation 


27 =5+12i has the solutions + (3 + 2i). 


In general, the formule: of Theorem 2.1.1 become messy. © 


2.2 The equation of second degree. 

The usual solution formula of the polynomial equation of second degree with real coefficients is still 
valid, when the coefficients are complex. The only modification is that we shall choose one of the 
two possibilities of the square root b? — 4ac, which is one of the solutions of the binomial equation 
z? = b? — 4ac of degree 2. 

Theorem 2.2.1 The solutions of the polynomial equation of second degree 


azr+bz+c=0, 


where a € C \ {0}, and b, cE C are constants, are given by 


fis) = {-b4 JP 4ac} 


Proor. Let 62 — 4ac denote one of the two solutions of the binomial equation w? = b? — 4ac. The 
number of candidates of the solutions is two, so it suffices to check the candidates in the original 
equation. A rearrangement of (18) gives 


b Vb? — 4ac 


Zz =x ; 


2a 2a 


hence, by a squaring, 


b b? b? — dac _ b? 
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which is reduced to 


b c 
2+—2+-=0. 
a a 


Finally, we multiply by a to get the original equation. 


Example 2.2.1 The equation z? — 2z — 2 — 4i = 0 has the solutions 


= 5 {24 4 — 4(—2 — 47) =1t VIF @4H) = 14 VIFH=14 24H) ={ Pal 


where we have used that +/3 + 4i = (2 +72), cf. Example 2.1.2. A check shows that 


a, +ag=2=-a, and ayaz = —3+1-41 = -—2-4i = ap. © 
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2.3. Rational roots. 


If all coefficients ao,...,@n € Q of the polynomial P(z) are rational numbers, then there is a limited 
set, which is easy to find, of possible rational roots, where we just have to check each one to see, if it 
indeed is a root of P(z). first notice that we showed in Section 1.2.2 that if P(z) has only rational 
coefficients, then we could find an equivalent polynomial with only integer coefficients, and even obtain 
that bo) = 1. We shall not need this stronger result here, so in the following it is sufficient to assume 
that all coefficients are integers. 


We introduce the following notation. Assume that p and q € Z, where q # 0. We say that q is a 
divisor of p and write q|p, if there is an r € Z, such that 


p=q'r. 


Theorem 2.3.1 Assume that the polynomial 
P(z) = ap2" +a:2 1 + +++ + Gn_izt Gn, ao, Q1,.--, An € Z, 


has integer coefficients, where ag # 0. 
Assume that z = — € Q is a rational root of P(z), where p € Z and q € N do not have other common 


q 
divisors from Z than +1. Then 


pian and qlao. 


PROOF. We assume that P (£) = 0, where p € Z and q € N do not have other common divisors than 


qd 
+1. Then 


vy" a = agp" + ap" !q+++++an-1pq" | + ang", 


hence by some rearrangements, 


p{aop”* + ayp"-7q +--+ + an_ig™ *} = —-anq”, 
and 
q{aip"—* + agp" 7q +--+ + ang”*} = —aop”. 
Since do,..-,@n, Pp, g € Z, and p and q have only the trivial common divisors, it follows from the first 


equation that pla, and from the second one that q|ao. 


In practice Theorem 2.3.1 is applied in the following way. Assume that the polynomial equation 
P(z) = 0 has only integer coefficients. Let {qi,..., qe} be all mutually different (positive) divisors in 
ag, and let {pi,..., px} be all mutually different (positive) divisors in a,. Then the possible rational 
roots (if any) must belong to the set 


{+e f= 1 phand Jay uyth, 
qj 


Finally, we check all these at most 2k¢ possibilities. 
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Example 2.3.1 We shall solve the equation 
P(g) = 23+37-4=0 


of integer coefficients. First notice that since a; = 0 and ap - a2 > 0, it follows from Example 1.4.1 
that we must have two complex conjugated roots, and there is precisely one real root. If this root is 
rational, it must be one of the elements of the set 


{21,22,44}, 
because ap = 1 and a, = —4. It follows by inspection that z = 1 is a root. Then we get by a division, 
P(z) = 2 4+32-4= (2-1) (2? +244). 


Solving 2? + z+ 4=0 we get the remaining two roots. Summing up, the three roots are 


24 =1, f= ——-+i— and z=-—~—-—i-—_. 


The equation is of third degree, so it could in principle be solved by Cardano’s formula, which has 
been omitted here. We shall here without details show why this is not done. In fact, if we instead of 
the above apply Cardano’s formula, then we get after some very long and tedious computations that 
the three roots are given by 


ga A/a da/s 4 /o- V5, 


__f.1. ls | 
a= {-pro9t De Vi4) 3-15 
a={ : Bhi V5 Bh if V5. 


It is far from obvious that {21, 22, 23} and {21, 22, 23} describe the same set of points. 


= 
a 
—) 
wo 
i) 
a 


i) 
i) 


Since solutions by Cardano’s formula usually have the complicated structure of 21, Z2, 23 above, we 
have decided here not to bring Cardano’s formula to avoid that the reader would be tempted to use 


it. © 


Remark 2.3.1 Theorem 2.3.1 does not assure that a polynomial of integer coefficients has rational 
roots. In case of P(z) = z? +1 we have ag = 1, ay = 0 and az = 1, so the candidates of rational roots 
are +1. However, none of these is a root, the roots being the complex numbers +7. 


Remark 2.3.2 Assume that 
P(z) = aoz” +42" | +-+-+an_1z+ Gn, a0, Q1,---,4n EC, ap #0. 
Assume furthermore that every coefficient a; € C has rational real and imaginary parts, 


a; =a; +if;, aj, 8; €Q 705 do og 
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Then the method above of finding rational roots zz, € Q still applies. In fact, if 


P(4) =o, pEZandqeN, 


where p and gq only have the common factors +1, then we get by splitting q” P(®) = 0 into the real 
qd 


and imaginary parts that 

aop” + arp" —*q +++» + On—1pq"* + ang” = 0, 
and 

Bop” + Bip" *q+-++++ Bn—1pq"* + Brg” = 0. 


Multiplying by some constant from N we may assume that ag,...,@n, and ,..., Bn are all integers, 
so it follows from Theorem 2.3.1 that 


pla and p| Bn as well as qlao and q|Qo. 


Then it is easy to find all candidates of a rational root and then check it in the original equation. © 
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Remark 2.3.3 Whenever the task is to find the roots of a polynomial it will always be a good strategy 
first to check if the methods of this section apply. 0 


Example 2.3.2 We shall find all roots — if possible — of the polynomial 


P(x) = 2§ — 10x + 4024 — 822° + 91a? — 527 +12 


of integer coefficients. 


-0.64 


Figure 6: The graph of P(x), which suggests that 1, 2, 3 are roots, though this is not a proof in itself. 


Here, a9 = 1 and ag = 12 = 1-2-2-3,s0 by Theorem 2.3.1 the candidates of possible rational roots 
are 


1 


2 


peo, Sey aad eR 9, 

Then notice that P(x) > 0 for x < 0, so every real root must be positive. This leaves us the possibilities 
1,2) 3,4, 612. 

We get by insertion, 
P(1)=0, P(2)=0, P(3)=0, P(4)=108, P(6)=6,000, P(12) = 1,197,900, 

so z = 1, 2, 3 are roots, and 


(2 — 1)(a — 2)(a — 3) = 2° — 627 + lle —6 


is a divisor in P(a), and we get by a division, 


P(x) = (x — 1)(x — 2)(a — 8) (@* — 42 + 5a — 2). 


The candidates of rational roots of the latter factor are x = 1, 2. Notice that they necessarily must 
be included in the previous set. We get by insertion, 


13—4-174+5-1-2=0 and 2°—-4-274+5-2—-2=8-164+10—-2=0, 
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so both x = 1 and x = 2 are roots in the latter factor. A division by («—1)(2—2) = x? —32+2 gives 
P(a) = (x—1)(a—2)(a —8) (a — 42? + 5a — 2) 


(x — 2)(a 
= («—1)(@—- 2)(@— 8){(— 1)(@— 2): (@—1)} 
= (—1)*(x—2)*(«— 3), 


1 
1 


and the six roots are 


i, i. a. O.. oe e 


This example has been chosen as simple as possible. In general the computations are not that easy. 


At the same time it is illustrated that we get more information of P(a) in the factorized form 


than in the original form 


P(x) = 2® — 102° + 40x* — 8223 + 91a? — 52x 4 12, 


so a rule of thumb is to keep a factorization of a polynomial as long as possible. > 


Example 2.3.3 We shall find all roots of the polynomial 


P(x) = 2° + 22 — 243 + 247 — 3a 


of integer coefficients. 


—~L_/ 


Figure 7: The graph of P(x), which suggests that 0, 1, —3 are roots. 


Obviously, x = 0 is a root, and we have P(x) = x- F(x), where 
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is a polynomial of integer coefficients. 


Here, ag = 1 and ag = 8, so it follows from Theorem 2.3.1 that the candidates of possible rational 
roots are +1, +3. We get by insertion, 


F(A) =142-049=320, Fp tS 299 9, 
F(3) = 81054.18+ 6-3 40, F(—3) = 81—54-18-6-—3=0, 
and we have proved, what was indicated on Figure 7 that x = 0, # = 1 and x = —3 are roots of P(z). 


A division by (a — 1)(a +3) = x? + 2x —3 gives the following factorization of P(z), 


P(x) =a- F(a) =a: (a? + 2¢ — 8) (a? +1) = a(x -1)(2 + 3)(e—-1)(x +4), 


and the five complex roots are 


2.4 The Euclidean algorithm. 
We shall here shortly describe how we divide a polynomial 

P(z) = agz" +a,2"-14+--++ aq, ag,.--,4n €C, ag # 0, 
by another one, 

Q(z) = boz™ + diz ++ 0m, 005-2, bm EC, = F,~, 


with remainder term, where we assume that m < n. Usually even m < n in this division algorithm, 
and the remainder term is a polynomial R(z) of degree < deg Q =m. 


If one does not use a computer, the best way is to use a so-called “gallows construction” , 


Q(@) Plz) 


For the given polynomials above the construction starts in the following way, 


bo2z™ + by z+ ++ + bn |agz™ + az + + +++ + ap] (ag/bo) 22—™ 


b 
(ao/bo) 2" - Q(z) = az” + ag- - grt eee 
0 
: : bi\ n-4 
Subtraction gives: a1 —ag- 7 Jz fee, 
0 


and then proceed similarly, until the bottom polynomial (the remainder term) has lower degree than 
Q(z). Since the degree is lowered by at least 1 at each step, this construction contains at most n—m+1 
steps. 
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Example 2.4.1 We shall find a and 6 such that 2? + 2+ 1 is a divisor of «+ + 32° +52? + ax 4b. 
It follows by the division algorithm that 
a? +o + 1 ot + 327 + 5a? + on + blz? + 224+ 2 
vit att a+ 


Qn? + 4? + ax +b 

Qe + 2x? + Qa 
Qn? + (a—2)e+h 
Qu? + Qn +2 
(a—4)a+b-—2 


If z? + 2+ 1 is a divisor, then the remainder term must be 0, thus a = 4 and b = 2, and we get 
at + 3a° +507 +4+42+2= (a7 +a+41) (a? + 2a + 2). 


The roots are 


WHILE YOU WERE SLEEPING... 


DUKE 


THE FUQUA 
SCHOOL 
www.fuqua.duke.edu/whileyouweresleeping OF BUSINESS 
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It is easy to understand the principle of the division algorithm above. However, if the coefficients 
are not integers, the computations become usually very hard and tedious. In some cases one is only 
interested in a constant times the remainder and not in the quotient itself. If the coefficients are 
integers, then the third line in the gallows will usually have rational coefficients and not integers. 
Multiply this third line by a constant, such that this new fourth line has integers as coefficients and 
proceed in this way. Of course, in this case it does not make sense to indicate the quotient. Only the 
remainder term times a convenient constant is here of interest. 


In theoretical considerations one argues on the pure division algorithm as in the Euclidean algorithm 
described in the following. 


Let P,(z) and P2(z) be two polynomials, where deg Py < deg P,. Then by the division algorithm we 
get a unique quotient Q(z) and a unique remainder term P3(z), such that 


P,(z) = Po(z)- Qi(z) + Ps(z), deg P3 < deg Po. 
Then repeat this process with P2(z) and P3(z), thus 
P2(z) = P3(z) - Qo(z) + Pa(z), deg Py < deg P3, 
where Qo(z) and P4(z) again are uniquely determined polynomials. 


Proceed in this way, until we obtain an equation, in which the remainder is the zero polynomial. Thus, 


Pi = P2Qi + Ps, deg P; < deg Po, 
P2 = P3Q2 + Pa, deg Py < deg Ps, 
Pm-2 = Pm-1Qm-2 + Pm, deg Pm < deg Pr—1, 


Prt = PrQm-1- 


It follows from the first line of (19) that a common divisor of P; and P must also be a divisor of P3. 
Then the second line of (19) implies that this common divisor is also a divisor of Py, etc., so it must 
be a divisor of Py. 


Conversely, it follows immediately from (19) that Py,|Pm-—1, hence also Py,|Pm—2, etc. so Py, must be 
a divisor in both P; and Py. 


Hence we have proved 
Theorem 2.4.1 Given two polynomials P\(z) and P2(z). There exists precisely one normalized poly- 
nomial D(z), t.e. the coefficient of the term of highest degree in D(z) is 1, such that all common 


divisors of P,(z) and P2(z) are precisely all divisors of D(z). 
We call D(z) the greatest common divisor of P,(z) and P2(z), and we denote it by 


D =(P,, Po). 


One usually finds D(z) = Pn(z) by means of the Euclidean algorithm (19) above. 


Remark 2.4.1 If we are only interested in the remainder polynomials P; of (19) and all coefficients 
are rational, it may be convenient to apply the modified division algorithm described after Exam- 
ple 2.4.1. 
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Corollary 2.4.1 Let (P,,P2) = D. If Q is a normalized polynomial, then 


(P,-Q,P2-Q)=D-Q. 


ProoF. This follows from the fact that the Euclidean algorithm for P,Q and P2Q is obtained from 
the Euclidean algorithm for P, and P2 by multiplying every division equation by Q. 


Example 2.4.2 We shall find the greatest common divisor of the two polynomials 


e* — 32° +527 +2—4 and go ie? —Re? be" = 4g — 1; 


This example will demonstrate that without a computer the Euclidean algorithm gives some very 
tough and tedious computation to carry out by hand. We shall therefore not follow (19) strictly, but 
use some shortcuts, whenever possible. 


The problem can in fact be reduced, if we start by checking the possible rational roots +1. Of these 
only « = 1 is a root of the latter polynomial, and x = 1 is also a root of the former polynomial, we 
may reduce the problem considerably by a division by x — 1. 


We shall in the following assume that we have not noticed that x = 1 is a common root of the two 
polynomials. Then by the division algorithm, 
a* — 32° + 5a? +a —4|2° + 72* — 82° + 5a? —4e 1|2 +10 
a” — 304 +503 x? — Ag 
1024 — 132° + 4x? -1 
10x* — 30x + 50x? + 10x — 40 
172° — 467 — 10x + 39 


Then multiply the first divisor 2* — 32° + 52? + 2 — 4 by 17 and divide the result by the remainder 
term 17x? — 462? — 102 + 39, 


172° — 46a? — 102 + 39|172* — 512° + 8527 + 17x — 68|x 
1724 — 462° — 102? + 39 
5a? + 95a? — 22a — 68 


The next division is not nice, so we use the modified algorithm, multiplying the remainder by —17, 
before we proceed, 


172° — 46a? — 102 + 39|852° — 161527 + 374@ + 1156|5 


85a — 23027 — 5Oa+ 195 
—1385277 + 4247+ 961 


We change the sign of the remainder term, and the theory then tells us that we shall divide 13852? — 
4242 — 961 into some multiple of 17z° — 46x? — 102 +39, so the factor should be chosen as 1385. This 
does not look too nice, so instead we notice that 13852? — 424% — 961 is a polynomial of second degree 
with a known solution formula. The roots are 

961 


1 Seas 
oud 1385’ 
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sO 
138522 — 4242 — 961 = (x — 1)(13852 + 961). 


It follows by insertion that 2 = 1 is also a root of 172° — 462? — 10x +39, and it follows by the division 
algorithm that 


a — 1|17% — 46a? — 10x + 39172? — 29% — 39 
172° — 172%? 
—29x7 — 10x + 39 
~29a7 + 29x 
—39x + 39 
—392 + 39 
0 


where the remainder is 0, so 
172° — 4627 — 10x + 39 = (a — 1) (17x? — 292 — 39) . 
The possible rational roots of 17x? — 29x — 39 are 


1 3 13 39 
Le iW a 17’ lt 


1, +3, +13, +39, 


961 
Clearly, none of these is equal to ————, so the only common root is x = 1, and the largest common 


-_ : 1385’ 
divisor is 


D(a) =a2-1. ?) 


2.5 Roots of multiplicity > 1. 
It is sometimes possible by applying the Euclidean algorithm to find the roots of multiple multiplicity. 


Theorem 2.5.1 Given a polynomial P(z). The roots of multiple degree of P(z) are the roots of the 
greatest common divisor Dz = (P,P’) of the polynomial P(z) and its derivative P’(z). Each of the 
roots of Dz has a multiplicity which is 1 smaller than its multiplicity in the original polynomial P(z). 


PROOF. Write 


(20) P(z) =a(z—a1)"?---(z—-—a,)”", Pi;-++Pr 21, 
where a1,...,@, denote the r mutually different roots. Then 
P'(z) = a-p (z—a1)™ * (2 — a0)” -+- (2 — a)” 
Lo pmiee) Gag" Ga)" 
Lot. ei) rea eee 
(21) = a(z—ay)P"*---(2— ay)? {pi (2 — ag) -+- (2 — Op) +++ +p (Z— 01) ++ (2 — pa) J - 
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If we put z = a; into the latter factor of (21), all terms disappear with the exception of 
(22) Pj (a, — a1) S01 38 (a; = a;-1) : (a; = A541) S2hiss (a; _ Qy) # 0. 
It follows from (20) and (22) that 


Dg = (P, P’) = (z—04)"!"* ++» (z—a,)?”* 


where we of course remove all factors where the exponent is p; = 1. 


Example 2.5.1 We shall find all n € N, for which the polynomial (z + 1)"*! + z”*+! +1 has roots of 
multiplicity > 1, i.e. we shall find n € N, such that 


P(z)=(24+ 1)" +27 41 and P'(z) = (n+1){(2+ 1)" 4+ 2" 


have common roots, i.e. the greatest common divisor D(z) = (P, P’) is a polynomial of degree > 1. 


In the past four years we have drilled 


31,000 km 


That's more than twice around the world. 


Who are we? 

We are the world’s leading oilfield services company. Working 
globally—often in remote and challenging locations—we invent, 
design, engineer, manufacture, apply, and maintain technology 

to help customers find and produce oil and gas safely. 


Who are we looking for? 
We offer countless opportunities in the following domains: 
= Engineering, Research, and Operations 


= Commercial and Business 


If you are a self-motivated graduate looking for a dynamic career, 


apply to join our team. What will you be? 


careers.slb.com Schlumberger 
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Using the Euclidean algorithm we get 
(g+1)"+ 2"|(2+1)-(¢4+1)"4 z-2"4+1|z+1 
(2+1)-(@4+1)"4+ (e411): 2” 


=2" 41 


so the task is reduced to find common roots of the polynomials (z + 1)"+ 2" and z” — 1. This is of 
course equivalent to find the common roots of 


{(g41)"4+ 27} -— {2-1} =(¢41)"41 and 2” — 1. 


The roots of (z+ 1)” + 1 lie on a circle of centre —1 and radius 1. 
The roots of z” — 1 lie on a circle of centre 0 and radius 1. 


Figure 8: The roots lie on both circles. 


The only possibilities are, cf. Figure 8, 


1 . V3 Qin 
ae 5 = exp sae ae 


It only remains to find n € N, such that these two numbers are roots in both (z+ 1)” and z” — 1 
Notice that if 


1 3 2% 1 3 
= 541 G =en(2"), then ot i= gti = cxn(+) 


a a 
with corresponding signs. Hence, 


2int 


2 = exp( 3 )=1 forn=3p, peEN, 


and 


(20 +1)" = exp( 54) =-1 for n = 3(2p+ 1),p EN, 
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and we conclude that the possible exponents must have the structure 
(23) n = 3(2p + 1), pENo. 
Then by insertion, 
P (29) = (2 $1)"*1 + 2414.1 =-(a +1) +24+1=0, 
and 


P'(z) = (n+ 1) {(2o4+1)" + 26} = (nt 1I{-141} =0, 


thus it follows for the given exponents (23) that 


1 v3 ( a=) 
Z = -—= 1t— =exp(r+—— 


2 2 3 


are indeed roots of multiplicity > 1, and there is no other possibility. 


Finally, it follows from 


Pit te in { (zo Layee ao = (n4 1ynf exn(+) +exn(= 5 )} 


= instyn| (FZ) ( rite) - (n+1)n40 


that the multiplicity is 2 in both cases. © 


Returning to Theorem 2.5.1 it follows that we can repeat the process on Dz. This means that the 
roots of D3 := (Dz, D4) are the roots of P(z) of at least multiplicity 3, and their multiplicities in D3 
are their multiplicities in P(z) minus 2. If D3 is not a constant, then proceed with D4 := (D3, D3), 
etc. 


Summing up, we get a sequence of polynomials of decreasing degrees, 


D, =P all roots of P, 
D2 = (P, P’) all multiple roots of P, 

(24) Ds = (D2, DS) all roots of P of at least multiplicity 3, 
D; = (Dj-1, Di_1) all roots of P of at least multiplicity j. 


Obviously, this process stops after a finite number of steps. 


We immediately get from the above 


Theorem 2.5.2 Let the D; be defined by (24). If Dj41(z) # 0, then the roots of the quotient 
D,(z)/Dj+41() are all simple. They are the roots of P(z) of precisely multiplicity 7. 


Theorem 2.5.2 is convenient in the sense that the multiple roots of P(z) of multiplicity j are the simple 
roots of the simple polynomial D;/Dj+1, where one could hope for more efficient solution methods, 
because the degree of D;/Dj+1 is smaller. The disadvantage is of course that it cannot be used, when 
P(z) has only simple roots, because then Dz = (P, P’) = 1. 
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Example 2.5.2 Given the polynomial 


P(z) = 28 — 32° +724 — 1023 + 82? — 5242. 


We shall factorize P(z) in the following way, 
P(z) = Gi(z) - G3(z)---Gin(2), 


where G,(z) is the product of all factors z—a,, corresponding to the roots of multiplicity g, and where 
m is the highest multiplicity of roots in P(z). 


We first check for possible rational roots, cf. Section 2.3. These can only be one of the numbers +1 
and +2. Since the terms of P(z) have alternating signs, —1 and —2 are not possible roots. Then we 
get by insertion, 


P(1) =1-3+7-10+8-54+1=0, 


and 


P(2) = 64—3-32+7-16—10-8+8-4—5-24+2=8+36- 1042 = 36. 
We conclude that z = 1 is a root. 


Since 


P'(z) = 62° — 152* + 2823 — 3027 +16z—-5, and P*(1)=0, 


P"(z) = 30z* — 60z° + 8427—60z+16, and P”(1)=1040, 


we conclude that z = 1 is a root of multiplicity 2, and (z — 1)? = z? — 2z +1 must be a divisor in 
P(z). We get by division, 


z= Qe-+1\2° — 325 + 72* — 1029 + 827 — Se + 2/2* — 29 + 42? — 242 


28 — 2254 24 


227 —42 +2 
227 —42 +2 
0 


We conclude that 


(25) P(z) = 2° — 32° + 724 — 1023 +. 827 —5z4+2=(z-1)? {34 B+47—z42h, 
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so the investigation is then reduced to the polynomial 


P,(z) =2* — 28 + 42" —242 


, where Pi(z) =4z3 —327+8z-1. 
We divide P/(z) into 16P;(z), 


4z3 — 327 + 8z — 1|162z* — 162° + 642? — 16z + 32/42 -1 


1624 — 1223 +3222 — 4z 


Az? + 3227 — 122 + 32 
—43 4+ 327- 824+ 1 
2927 — 42431 
The remainder 20z2 — 4z + 31 has the roots 
2+1/895 
29” 


———= 


Va 


(nace) 


ACCREDITED 
w J 


CLICK HERE 


to discover why both socially 
and academically the University 
of Groningen is one of the best 
places for a student to be 
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which by insertion are seen not to be roots of 423 — 327 + 8z—1. We therefore conclude that (25) is 
the factorization with 


Gi(z) = 2° — 2? +427 -—2+2 and Go(z) =z-1. 


The simple roots (two pairs of pairwise complex conjugated roots) of G';(z) cannot be found with the 
methods known so far in this investigation. © 


Example 2.5.3 Let P(z) be a polynomial of rational coefficients. Assume that a is the only root of 
multiplicity v. We shall prove that q@ is rational. 


From Theorem 2.5.2 follows that we can write 
P(2) = Gi(z)G3(z) --- Gi (2), 

where by assumption, 
GY(z) =(z—-a)", 


and where in general, G;(z) has the simple roots which are precisely the roots of P(z) of multiplicity 
j. We get by the Euclidean algorithm, 


dD, = (P, P’) — G2G3-- Cae . aa 
Dy = (yD) = GsGg--GE PGR, 
Dog = Wy) SS tee, 

Dp => Dy-1, Diy it Gy4i Sassi Gr, 

Dy-1 = Gn. 


Since rational coefficients are preserved by the Euclidean algorithm, G,, must have rational coefficients. 
Then also 


Py(z) = P(z)/Gh(z) = Gi(z)G@3(z)--- Ga1(2) 


must have rational coefficients, and we can repeat the procedure from the very beginning of this 
example on P;(z). The conclusion is, that G,—1(z) must have rational coefficients, etc., so every 
G;(z) must have rational coefficients. For 7 = v we get G,(z) = z— a, because no other root has 
multiplicity v. We therefore conclude that a € Q is rational. > 


Example 2.5.4 We shall reconsider the polynomial 


P(z) = z® — 102° + 402* — 8223 + 912? — 522412 


of Example 2.3.2. This time we shall find its roots by using Theorem 2.5.2 instead. 
We get 


P'(z) = 62° —50z* +1602? — 2462? + 1822 — 52 
= 2{32z° — 2524 + 802° — 12327 + 91z — 26}. 
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If we divide P(z) by P’(z) we get a quotient of the form Qo(z) = az+ ( and a remainder term. This 
shows that it would be better to divide 


3°P(z) = 92° — 90z° + 36024 — 738z3 + 819z? — 468z + 108 


by 


P'(z) = 32° — 2524 + 8023 — 1232? + 91z— 26. 


At this step we can leave the division to the reader for exercise. The result becomes 


3? P(z) = (3z—5)- ; P'(z) — {524 — 3123 + 692? — 65z + 22}. 
Then we divide 

5? ; P'(z) = 752° — 6252z* — 2000z3 — 307527 + 22752 — 650 
by 


Ro(z) = 524 — 31z° + 692? — 652 + 22. 


The result is 


2 = 
* P'(2) = (15z—32)Ro(z) — 27z° + 1082” — 135z + 54 


= (15z—32)Ro(z) — 27 {28 — 427 + 52-2}. 
Then divide Ro(z) above by the modified remainder 
Ry(z) = 2 — 427 +52z-2 
to get 
Ro(z) = (5z — 11) Ri (2), 
so we conclude that the greatest common divisor is 


D,(z) = (P(z), P'(z)) = Ri(z) = 28 — 427 +52 —-2. 


We check if D,(z) has multiple roots. First, 
Di (z) = 3z7 — 8z +5, 
so we get by a polynomial division, 
3? D,(z) = (32 — 4) Dj (2) — 2(2-), 
where the remainder is Ro(z) = z — 1. Then finally, 


Di (2) = (32 — 5) Ra(2), 
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and we conclude that 
Da(z) = (Di(z), Di(z)) = 2-1. 
Summing up, we have proved that 


P(z) 26 — 1025 + 4024 — 8223 + 91z? — 52z +12 


I 


(26) ¢ Dy(z) 22 — 427 + 52-2 


I 


Dog(z) = z-1. 


It follows from Do(z) = z—1 that z = 1 is a root of multiplicity 3. By a division, 


P 
@) = S12 4162412 

eal 
and 

Di) _, 9 

(z—1)° 
so z = 2 must be a root of multiplicity 2 of P(z). 
Finally 

P 
(z) See, 


(2-152 — 2p 
and we conclude that 


P(z) = (z —1)3(z — 2)?(z —3). 


An alternative method is to use (26) to get 
P(z) 3 2 
= 2° —62° + 112-6, 
Dy (2) 


where the simple roots of this quotient are all mutually different roots of P(z). Furthermore, 


D,(z) 2 
=2Z 3z+2= (z-1)(z—- 2), 
Baa (2-1-2) 
so z=1 and z = 2 are the roots of multiplicity > 2. Finally, 


Pz) Dil2) _ P(2)- Dalz) _ 
D,(z) : Dog D,(z)? 


3, 


where we have removed all factors of higher multiplicity. Hence, z = 3 is the only simple root, and 
(z — 1)?(z — 2)?(z — 3) must be a factor of P(z). It follows again from the division 
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3 Position of roots of polynomials in the complex plane 


3.1 Complex roots of a real polynomial. 


Recall that a polynomial is called real if all its coefficients are real. The following well-known theorem 
is here included for completeness. 


Theorem 3.1.1 Let P(z) be a real polynomial. If a+i8, 840, is a complex root, then the complex 
conjugated a — i is also a complex root. In particular, 


(z—(a+i8))(z — (a —i8)) = (z—0)? +B = 2? —2az+ 07 + B?, 
is a divisor of P(z) of real coefficients, so 
P(z) = (27 — 2az + a? + 87) Q(z), 


where Q(z) is also a real polynomial. 


www.alcatel-lucent.com/careers 


~4 — 
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future and 
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future? 
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Proor. Assume that 

P(z) = az" +a y2" 1 + > + agaigt+ an, ao, 1, .--, A, ER, 
has the complex root a + 13, where a, 3 € R and 6 4 0. Then 

P(a + if) = ap(a+ iB)” + ay(a@ +18)" +++ + Qn—-1(a +78) + an = 0. 
Since the a; are all real, it follows by complex conjugation that also 

P(a — if) = ap(a — iB)” + a, (a —i8)"" +++» + an_1(a — 18) + a, =0=0, 


hence a — iG is also a root of P(z), and since 3 4 0, we have a—i3 4a+i{, so 


(z—(a@+46))(z — (a —79)) = 27 — 207+ 07 + 9? 


is a divisor of P(z) of real coefficients, so the quotient Q(z) is also a real polynomial, and 
deg Q = deg P — 2. 


If we argue similarly on Q(z), it follows immediately that if a +7@ is a root of order 7, then a — iG is 
a root of the same order 7. 


We mention in this connection a similar theorem with square roots instead of the imaginary “i”, so 
such a conclusion of a “twin solution” is not restricted to the complex case alone. 
Theorem 3.1.2 Assume that all coefficients of P(z) are rational numbers, ap, a1, ..-, An € Q. If 


P(z) has the root a+4/@, where a, 8 €Q and VB EQ, then a—/P is also a root of P(z). 


ProoFr. Assume that a+ /@ is a root, where a, 8 € Q and /3 ¢Q. Then 


P(at+ 8) ag (a vB)" + ay (a VB) be tant (a+ VB) +an 


j=0 
= wfartna™ V+ (9 )atate baton 


= Q1(a.8) + VB Qo(a, 8) = 0. 
Since both Qi(a, 3), Q2(a, 8) € Q and V8 ¢ Q, we must have Qi(a, 3) = 0 and Qo(a, 3) = 0. 


Similarly, 


P(a— 8) a a (a VB)" +a: (a- VB) 


II 
8 
oO 
a 
NY 
Q 
3 
ob. 
a 
an 
SS" 
a. 
D 
Q&S. 
is 
bo 
+ 
fra) 
a 
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°o i 
7 
3 
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The terms in which \/@ does not occur must correspond to even j, in which case (—1)/ = 1. Summing 
up, we get the same Q(a, 3) as above. 


The terms in which JB does occur as an extra factor must correspond to odd j, in which case 
(—1)? = —1. Summing up, these terms add up to —\/3- Qo(a, 3), where Qo(a, 3) is given as above. 
Since Q1(a, 3) and Q2(a, 3) are the same in the two cases, we get 


P (a— VB) = Qi(a,8) — VB Qala, 8) =0. 
It follows that 


(2- (a+ VB)) (2- (a- VB)) = (2-0)? - B= 2 - 207+ 07-6 


of rational coefficients is a divisor in P(z), 


P(z) = {27 —2az +a? — B} Q(z), 


where Q(z) has rational coefficients. 


3.2 Descartes’s theorem. 


We have previously from time to time used that if a polynomial 
P(x) = aga” + aya" +--+ + an_12 + a0, zeER, 


has only positive coefficients, aj > 0 for all j, then P(z) cannot have positive roots, and if the 
coefficients are alternating, a; = (—1)’ |a;| (or aj = (—1)4*" |a;|) for all j, then P(x) cannot have 
negative roots. 


We shall in this section derive some improved results concerning where the real roots are lying on the 
real axis. We assume that the real polynomial is normalized, i.e. that ap = 1. 


Theorem 3.2.1 If P(x) is of even degree, then P(x) has an even number of real roots (including the 
possibility of no root at all). 

If P(x) is of odd degree, then P(x) has an odd number of real roots. All roots are here counted 
according to their multiplicities. 


Proor. The theorem follows from Theorem 3.1.1, which states that for real polynomials, non-real 
roots are always given in complex conjugated pairs, so the number of non-real roots is an even number. 
The fundamental theorem of algebra states that the degree of a polynomial is equal to its number of 
roots, counted by their multiplicity, so the theorem follows by parity. 


Theorem 3.2.2 Given the real normalized polynomial 
P(g) =a" + aya” + +--+ + ay_10 + Gn, Q1,-.-,4, ER. 
Define constants H and L by 
—H :=min{0,q1,...,a,} and — L=min {0,—a,+a2,...,(—1)"an}. 


If P(x) has a real root, then it must lie in the interval [—1— L,1+ H]. 
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ProoF. If >1+ H > 1, then we get the estimates 


oil) 
P(x) > 2” —H{a14q"%4...41}=2"- HX 


x-1 


(a-la"-H-e® +H H-a®-H-x +H H 
> = 
x—-1 x—1l xa-l 


from which follows that P(x) has no real root > 1+ H. 
A similar estimate shows that the polynomial 
(—1)"P(—2) = 2” — aya" " + aga”? —--- + (-1)"an 


does not have any root —x > 1+ L. Thus we conclude that P(x) does not have any root x < —1—L, 
and the theorem is proved. 


If a, # 0, then 0 is not a root, and there is a neighbourhood of 0 which does not contain any root 
from P(x). 
Corollary 3.2.1 Given the real normalized polynomial 

P(a) =a" + aye) ++) + ap_it + Gn, Q1,.--,4y, € R anda, £0. 


Define N and M by 


-N = min {o,2,,.., 22h and = M = min {0,( rane ae coat oo 
a 


an an an An 


If P(x) has a real root, then it must lie in one of the two intervals 
1 d 1 4 
—oo, -———_ n —— ; 
ee es i. +N" 


1 
Proor. Put « = —. Then 
y 


1 
v"P(;) = any" agay be ay += a {0 | ee ee i}. 


It follows from Theorem 3.2.2 that any real root y, if it exists, must lie in [—1 — M,0[U]0,1+ N}. 


1 
Since y = —, we conclude that any real root must lie in the union 
x 


1 1 
| me ale fee poo]. 
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Example 3.2.1 Let 

P(2) =a? — 47" + Gar"? + 202° — 27 — 227 — 4, 
Then 

—H = min{0, —4,6, 20, —2, —2, 4} = —4, hence H = 4, 
and 


—L = min{0, —4, —6,20,-2,2,-4}=-6, hence L =6, 


because @19-13 = ag = —4, Gi9—10 = G9 = 6, Gi9-5 = G14 = 20, A193 = Gig = —2, A192 = G17 = —2 
and aig = —4. If a is a real root, then a € [—1 — L,1+ H] = [—7, 5]. 
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Figure 9: The graph of P(x) = x19 — 4x13 + 6x19 + 202° — 2x3 — 2x? — 4. 


Furthermore, 


sO 


=M = min {0 so 1,7} = 1, thus M = 1, 


. mos 1 1 
so possible real roots must also lie in |—oo, 5 U 6? Foo}. 


Combining these results we see that possible real roots are limited to 


1 1 
—7,-=| U]=,5]. 
Ea] ef 
1 
It follows from Figure 9 that there is only one real root and it lies in the interval [0.7,0.8] C E 5 9 


Theorem 3.2.3 Descartes’s theorem (1637). Let 
P(g) =a" + aya) +--+ @n_10 + Gn; Q1,..-,4n ER, 
be a real normalized polynomial. 


e The number of positive roots of P(x), counted by multiplicity, is at most equal to the number of 
changes in sign in the sequence 1, aj, ..., Qn- 


e The difference between the two numbers is an even number. 


In the count of changes of sign we only consider coefficients aj #0 which are not zero. 
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ProoF. If P(0) = 0, then the root « = 0 has some multiplicity k € N. Since 0 is not positive, we can 
divide by x* to get P;(x) = P(x)-x~* for x > 0. Then by continuity, P,(0) 4 0, and P,(x) and P(z) 
must have the same positive roots, all of the same multiplicity. 


We may therefore in the following assume that an, = P(0) 4 0. 


Theorem 3.2.3 clearly follows, if we can prove 


Lemma 3.2.1 Let P(x) be a normalized polynomial of real coefficients, where P(0) #0. If P(x) has 
r positive roots, then there are r+ 2p changes of sign in the sequence 1, a1, ..., Gn, where p is some 
nonnegative integer, p € No. 


ProoF. Induction after r. 


1) 


If r = 0, then P(z) has no positive roots. Then P(x) has constant sign for « > 0. Since P(x) — +00 
for z + +00, this sign must be +, and we conclude that a, = P(0) > 0. 

The sequence 1, a), ..., @ then starts and ends with positive terms. If a, is the first negative 
term, then we search for the first following positive term az, € > k. It follows that we have two 
changes in the subsequence 1, ..., ae. 

Proceed in the same way with the subsequence ag, ..., G@,, where both ag and a, are positive, etc.. 
After a finite number of steps, each adding 2 to the count of changes of sign, we finally reach an, 
and the claim follows for r = 0. 


Assume that the lemma holds for some rp € No. We shall prove that it also holds for its successor 
r=rotl. 


We assume that the polynomial P(«) has 79 + 1 positive roots, and we choose one of them, a > 0. 
Then 


(27) P(z) =a" 4+ ae" 1+-->+a, =(2—a)Py(z), 
where 
Py(x) = a1 + byt +e + Oye 
must have ro positive roots, because we by division have removed one positive root a from P(x). 


Using the assumption of induction above, Po(x) has ro + 2po for some po € No changes of sign in 
its sequence of coefficients. 


Then consider more closely the sequence of coefficients 1, bj, ..., bn_1 of Po(a). Let 6), be the 
first negative of these, then b), the first positive of them after b),, etc., up to by which 


rot2po’ 
represents the last change of sign. 


We have schematically, 


1 hey Diag, Ag Dag, eg ey Thea 
~+ >0 = <0 4 & (—1)"9 
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Then notice that the coefficients a9 = 1, aj, . 
1, by, ..- 


ao = 1, 


, bn—-1 of Po(x) from (27) by the equations 


a, =b;—ab), ag=bz,—ab, -:-, 


an 1 = by, 1 — aby, 25 


.., Qn of P(x) are found from the coefficients bp = 


Qn = —aby_1. 


In particular, 


-inanci BI 


ao = 1, positive, 
ay, = by, — aby, -1, negative, 
ay, = by, — aby, -1, positive, 


ay = by — aby 


ro+2po ro+2po ro+2pq—1? 


an = —abn—1, 
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In the last equation we use that b,_; #4 0 must have the same sign as OK, ee 
Then consider the sequence 1, a1, ..., Gn of coefficients. According to the analysis above there 
must be an odd number of changes of sign between 1 and a),, an odd number of changes of sign 
between a), and a),, etc., until we obtain an odd number of changes of sign between a), +2p, and 
Gy. In the total count we therefore get that the number of changes of sign is ro + 2p9 + 1 plus an 
even number > 0. This is precisely rp + 1+ 2p for some p € No. 


Lemma 3.2.1 now follows by induction, and then Theorem 3.2.3 is trivial. 


Example 3.2.2 1) We have trivially one change of sign in P(x) = x — 1, so we must have a positive 
root. It is of course x = 1. 


2) The polynomial P(x) = x? — x +1 has two changes of sign, so we either have none or two positive 


roots. Since the discriminant is negative, we must have none real root. 


3) The polynomial P(x) = a4 — 7x? + 6a — 1 has no rational root. Its sequence of coefficients has 
three changes of sign, so the polynomial has either one or three positive roots. 
The polynomial P(—x) = x4 — 7x? — 62 — 1 has one change of sign, so P(x) has one negative root, 
cf. also Figure 10. 


Figure 10: The graph of P(x) = «4 — 7x? + 6x — 1. 


4) The polynomial 


P(x) = 2? — 4a) + 62'° + 202° — 223 — 22? — 4, 


also considered in Example 3.2.1, cf. Figure 9, page 49, has three changes of sign in its sequence 
of coefficients, so we have either one or three positive roots. 
The polynomial 


P(—a) — ao"? + 4¢* + 6a” — 202° + 247 — 227 — 4 


has four changes of sign in its sequence of coefficients, so P(x) has either none, two or four negative 
roots. > 
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Example 3.2.3 Consider the polynomial 
P(x) = 2” + ayn"! +--+ + an_12 + Gn, Q1,---,A, €R, a, £0, 


of degree > 3. Assume that two successive coefficients are 0, thus a; = a;+1 = 0 for some 
i€{1,...,n—2}. The sequence of coefficients for P(x) is then 


1, Q1, Q2,---, Ai-1, 0, 0, Qj42, +++, An, 
(at most n — 1 numbers # 0), and the sequence of coefficients for (—1)”P(—2) is 
1, —ay, a2, -.., (—1)*+, 0, 0, (—1)* "agra, ..., (—1)" an. 


We get an estimate of the number of positive roots of P(x) from the former sequence, and an estimate 
of the number of negative roots of P(x) from the latter sequence. Since a, 4 0, it follows that 0 is 
not a root. 


Choose j, such that a; and aj;4, #0, and then combine the two sequences above, 
a5, Qj+415 
(-1)/a;, (-1)?¥a541. 
It follows that we have just one (horizontal) change of sign in this group of coefficients. 


If one of a; or a;+1 is zero, we of course have no such change. Thus, combining the two sequences, 
where we in each horizontal sequence have at most n — 1 coefficients 4 0, we conclude that we can at 
most have (n — 1) — 1 = n — 2 changes of signs in total in the two sequences. Hence, the polynomial 
P(x) has at most n — 2 real roots, and thus at least one pair of complex conjugated roots. 


For n = 3 we have only the possibility of 2? + a3, where the roots are found by solving a binomial 
equation, cf. Section 2.1, hence the only real root is /—a3. 


For n = 4 we have the two possibilities 


eae? +a, and e* age + au: 
Notice that e.g. 24 —2x?+1 can be considered as a polynomial of degree 2 in the new variable z = x?, 
and that we have two changes of sign. However, it has the four real roots 1, 1, —1, —1, so both 1 and 
—1 are double roots. It is therefore in the result above essential that we assume that two successive 
coefficients are 0. Otherwise the conclusion may be wrong. 


3.3 Fourier-Budan’s theorem. 


Given a real normalized polynomial 
P(a) = 2" + aye) +++ + apie + Gn; Q1,.-., An ER, 
and put « = #9 +t for any real x9. Then by a Taylor expansion, 


P' (20) 


1! 


n! 


P (ao + t) = P (xo) 4 


? 
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where always P(”) (9) = n!. It follows from Theorem 3.2.3, Descartes’s theorem, that the number of 
roots of P(a) in the interval ]a9,-+oo[ (counted by multiplicity) at most is equal to the number V (1) 
of changes of sign in the sequence 


P (xo), P’ (ao), ..., P™ (ao) = nl, 
and that the difference between the two numbers is even. 


Given any half open interval Ja, (], closed to the right, the number of roots of P(x) in Ja, 3] (counted 
by their multiplicities) must for some nonnegative integers A and B be 


{V(a) — 24} — {V(8) - 2B} = V(a) — V(8) — 2(A- B), 


so this number is V(a) — V(Z), modulo some even number. We shall prove that we always have 
A> B. 


Theorem 3.3.1 Fourier-Budan’s theorem. (Mentioned at lectures in 1797; published in 1820.) Let 
P(x) be a real normalized polynomial of degree n. 


1) The number V(x) of changes of sign in the sequence 
Pla), PXa) 5 P(g) 
is a monotone decreasing function, which is half continuous from the right. 


2) For every interval ja, 3] the number of roots in this interval is at most V(a) — V(@), and the 
difference between the two numbers is even. 


3) If ax is negative and numerically large, then V(x) = n, and if x is positive and large, then V(x) = 0. 


Proor. Let 21,...,2y be all real numbers, which are roots in at least one of the polynomials 
P(x), P'(x), ..., P(™ (a), and let 21, ..., ey be increasingly ordered. Then each of the polynomials 
P(x), P'(x),..., P‘"-))(x) must have constant sign in each of the intervals 

]—oo, x1[, ]z1, 22[, sey Jznw-1,2N[, zn, +00, 


which is supplemented with the trivial P («) = n! > 0 for every 2 € R. Thus, V(a) must be constant 
in each of these intervals. 


Since clearly, 
P(x) = +00, P'(x) =? +00, sey PD (x) —> +00 for ZL- +00, 


we conclude that P(x), P’(x), ..., P(x) must all be positive in ]xy, +0o[, so V(x) = 0 in Jay, too]. 


Then notice that 
(—1)"P(a) > +00, (-1)""1P"(a) + +00, ..., —P@ (2) 4+00 2-00, 


which implies that P(«), P’(x), ..., P‘™ (a) are successively positive and negative in |—oo, «1[, so we 
conclude that V(x) = n in ]—oo, x} |. 


Theorem 3.3.1 follows, if we can prove 
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Lemma 3.3.1 Given anyi=1,...,.N. Then V (xi) = V(a) for alla € |a;,vi41[, and V (a) < V(a) 
for all x € |ai-1,2;[. In the latter case the difference is an even number, provided that x; is not a 
root of P(x). If on the other hand x; is a root of P(x), then this difference is instead equal to the 
multiplicity of the root x; in P(x) plus some even nonnegative number. 


PRooF. Consider the sequence P(x), P’(x),..., P(x) at the point x; and in the two adjacent 
intervals. 
If x; is a root of multiplicity m in P(x), then the first m numbers of P (2;), P’(x;), ..., P™ (a;) 


must all be zero, and then P‘”) (a;) 4 0. Its value can be positive as indicated in Table 2, or negative 
as indicated in the alternative (Alt.) of Table 2. 
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ak oe Alt. Alt Alt 
(altern. | 2; | (const. 
: Z Hea a; u;> 2 
signs) sign) 
P(x) Sh 0 T sk 0 —_ 
Peg) - 0 + + 0 - 
PO") (x) + + + - - - 
Alternatives 
Pt) (x) hee], a : ae) Weel eee alll : 
POTD g) 2 0 + +] 0 | - +] 0 | - +| 0} + 
PlE-) (a) - 0 + +] 0 | - +] 0) - - | OF} + 
PO(c) & |oe| cee eas! Peeled fell WBE er leats 
P©)(z) zi 4 a 
Table 2: Possible variations of signs in the neighbourhood of x; in the proof of Fourier-Budan’s 


theorem. 


We shall take into account the possibility that there later on may be zeros in the sequence 

P (a), P! (a), ..., P™ (x;). One such example is given in Table 2 for the sequence 

PF+)) (aj), ..., P“1 (x;), where P“) (a;) and P™ (2;) are chosen positive in the main case, while 
the possible alternatives are -, -, or +, -, or -, +. 


Whenever P“) (x;) 4 0, the function P“) must necessarily have the same sign in the two adjacent 
intervals. 


Thus, if both PY (x;) and P9+ (x;) are # 0, then the pair (P(x), PO+)(x)), will contribute with 
the same number (either 0, or 1) in the two adjacent intervals as at the point x; itself. 


Assume that 2; is a root of P(x). Then by Taylor’s formula, 


P(x) — pam (zi) (w— aj) +e-+, thus (2) wa (2) 
m! (x — aj) m! 
PO) (x, plm-2) PO) (x, 
PUP-ig) = (2%) (w — 24)? +++, thus (2) 
2! (x — 2) 2! 
PO) (x; Pplm-l) P) (x; 
PO-Di(z) ee) (w-—ai)te--, thus (2) ie ) 
1! LX; 1! 
for 2 — 2;. It follows in this case that P(x), ..., P°"~))(x) in the interval to the right of 2; must 


have the same sign as P’™) (2;), while P°”~") (x) in the interval to the left of x; has the opposite sign 


EO 


IF 


Download free eBooks at bookboon.com 


Methods for finding (Real or Complex) Zeros in PolynomialPosition of roots of polynomials in the complex plane 


x —oo | —2); -1] 0] 1 | 2 | +00 
P(2) = 0° —a*-—303 +22 +5] - - 


P' (x) = 5a* — 4a? — 92? +2 - 

P" (ax) = 20x? — 1227 — 182 - - - | 0} - 

P®)(x) = 60x? — 242 — 18 7 

P(x) = 1202 — 24 : Pani on 

P() (x) = 120 

V (a) 5 | Ss | 4 (22 ial a 


Table 3: Table of P(x), ..., P©)(x) and V(z) for x = —oo, —2, —1, 0, 1, 2, +oo in Example 3.3.1 


of P!(™ (a;), and P’"~?) (x;) has the same sign as P™ (2;), etc.. Hence, there is no change of sign 
in the subsequence P(x), ..., P(a) at x; as well as in the interval to the right of x;, and there are 
m changes of sign in the interval to the left of 2;. 


Concerning the possible subsequence P“) (x), ..., P((a) as in Table 2, a similar argument shows 
that P@ +) (a), ..., PY) (x) have the same sign of P (a;) in the interval to the right of x;, while 
in the interval to the left of x; the signs are alternating. Hence, there are just as many changes of 
sign in the interval to the right of x; of the subsequence P“) (x), ..., P(x) as at a; itself, namely 
none, if the combination of signs of P“*) (a;) and P (2;) is either +, +, or -, -, and it is 1, if this 
combination of signs is either +, -, or -, +. 


In the interval to the left of x; the number of changes of sign is even, if the combination of signs is 
either +, +, or -, -, in fact = the largest even number < ¢—k, and odd if the combination of signs is 
either +, -, or -, +, in fact = the largest odd number < @— k. 


It follows from the discussion above that we have the same number of changes of sign in the interval 
to the right of x;, and when we look at the interval immediately to the left of x; we have found a loss 
of m changes of sign, when 2; is a root of P(x) of multiplicity m, and an even number of changes 
of sign of each subsequence of the form P“)(x), ..., P/(a), and Lemma 3.3.1 is proved, hence as a 
consequence also Theorem 3.3.1. 


Example 3.3.1 Given the polynomial P(x) = 2° — 4* — 323 + 2x +5, we obtain Table 3 of the signs 
of P(x), P’(x),..., P®(a), so we can compute the value of V(x) for various values of x. 


The columns of —oco and +oo correspond to large negative and large positive x. It follows that all 
real roots of P(z) must lie in the interval | — 2,2], and that we have one root in ] — 2, —1[, none or two 
roots in ] — 1,0], no root in 0,1], and none or two roots in ]1, 2]. 


It follows from Figure 11 that we have one real root in ]— 2, —1], no root in ] — 1, 1] and two real roots 
in ]1,2]. The remaining two roots must be complex conjugated.> 


Example 3.3.2 Consider the polynomial P(x) = x4 — 7x? + 6x — 1 of Example 3.2.2. We choose 
x; = —4, —3,..., 2, 3, and then set up Table 4. 


We conclude from this table that we have one root in the interval | —4, —3], another one in the interval 
]2, 3], and none or two real roots in the interval ]0, 1], and none in | — 3,0]. 4 
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i) 
> 


thods for finding 


Figure 11: The graph of P(x) = 2° — x4 — 3x23 + 22 +5 of Example 3.3.1. 
x lo) 4 3 2 1/0} 1) 2 | 3) +00 
P(x) = «4 —7x*+6zr-1)| + + - - - -|-]-]+]) 4+ 
P'(x) = 423 — 142 + 6 - - - + | + - +) + 
PY") = 124? — 14 . = ||, se 
P®)(¢) = 24x “ : . 4 - | 0 +| + 
Pega 24 
V(a) 4 4 3 3 3/3] 1 )1/)0 0 
Table 4: Table illustrating Fourier’s theorem in the case of Example 3.3.2. 


3.4 Sturm’s theorem. 


The problem of determining the number of real roots in a given interval was solved in 1829 by the 
French mathematician Sturm. We shall apply the Euclidean algorithm on P(#) and P’(x). We write 
in the present case the equations of division in the following way, 


P(x) = P'(x)Qi(x) — P2(2), 
P(x) = Ps(x)Qo2(x) — P3(2), 
P(x) = P3(x)Qs(x) — Pa(z), 
(28) 
Pala) = Pre) Qa) _ Pol), 
Pm—-1(t) = Pm(x)Qm(z) 


This is also written in a more traditional way as 


P(t) = P'(x)Qi(x) + {Pa(z)}, 

Pz) = {-Po(x)}-{—-Qe(x)} + {—Pa(z)}, 
—P,(z) = {—P3(x)}Qs3(x) + Pa(z), 
—P3(x) = Py(x)-{—-Qa(z)} + Ps(z), 

Pa(x) = Ps(x)Qs(xz) + {-Pe(z)}, 


etc.. Here we note that the first and the fifth equation have the same combination of signs, so these 
will be repeated cyclically of period 4. 


Thus, the changed form means that the quotients of the equations of division have been denoted 
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Qi(x), —Qe(x), Q3(a), —Qa(ax), ..., and the remainders are — P(x), —P3(x), Py(x), P5(a), .... This 
means that (P, P’) is the normalized polynomial which is associated with P,,(x). 


The polynomial P,,(z) is a divisor in all of the polynomials P(x), P’(a#), Po(x), ..., Pm(2). 


We define the so-called Sturm chain as the sequence of polynomials, 


HCA) = Be yye PHU) == By He = Gy Bimal) = “BEET, Hel) = FES = 
When the equations of (28) are divided by P,,(x), we clearly get 
A(z) = Hi(x)Qi(2) — A2(z), 
A(x) = Hp(x)Q2(x) — Hs(z) 
a(x) = H3(x)Qs3(x) — Ha(z) 
(29) 
Am-i(@) = Hm-1(2)Qm-1(2) — Hm (a) 
Hm—1(t) = Hm(£)Qm(z), 
Hy,(“) = 1, 


where all polynomials have real coefficients. 
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Assuming that P(x) is normalized, we get from the Fundamental theorem of algebra that 
P(z) = (w©—ay)"-+-(w@- a)", 


where aj, ..., @s are the mutually different (real or complex) roots of P(x), and 11, ..., vs are their 
multiplicities, so 


Wyte +s =N, 

and the largest common divisor, cf. Theorem 2.4.1, page 33, is 
(P,P!) = (@- a)" +-(@— a4)", 

where we conventionally put (a — a;)° =H. 

Since P,,(a) and (P, P’) have the same roots of the same multiplicities, we conclude that 
H(x) =a(a%—a4):-:(a@—as), a#0, 


thus the polynomials H(x) and P(a) have the same (different) roots. Only in the case of H(x) they 
are all simple. 


After these preparations we formulate 


Theorem 3.4.1 Sturm’s theorem. Let W(x) denote the number of changes of sign in the sequence 
H(z), A(z), aoe Hm (2). 


Then W(x) is a monotonically decreasing function in x. It is half continuous from the right. 
For every half open interval ja, |, the number of mutually different roots of P(x) in |a, 3] is equal to 
W (a) — W(@). 


Proor. Let 21, ..., xy denote all real numbers, which are roots in at least one of the polynomials 
H (x), Hi(x), ..., Hm_—i(x), where we assume that they form an increasing sequence. 

In each of the open intervals |—oo, 71[, Ja1,22[, ..., }un—1,¢N[, ]@n, +00, each of the polynomials 
H (x), Hi(x),..., Hm—i(x) must have constant sign. Furthermore, H,,(#) = 1 > 0 for all 2. Hence, 


W () is constant in each of the intervals mentioned above. 
The theorem follows, if we can prove the following 
Lemma 3.4.1 Let x1,..., un be given as above. Fore everyi=1,..., N, the value of W (a;) is 


equal to the value of W(x) in the adjacent interval to the right of xj. 
In the adjacent interval to the left of x; the value of W(x) is given by 


W (ai), if x; is not a root of P(x), 
W(a2) = 
W (ai) +1, if x; is a root of P(x). 
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I y y 


Main case Alternative case 
Lee | Ce | VS ez Dae | Ge | a ae 
H (2) - 0 + + 0 - 


H(t) | + |+] + - fe] - 


Main case Alternative case 


Ay-i(“) | + + a = = a 


Hy (2) + 1G + 


Table 5: Table of variations of sign at a zero x; and in the two adjacent intervals. 


ProoF. Consider the sequence H(x), Hi(x), ..., Hm(a) at x;, as well as in the two adjacent intervals 
of Vie 


Since H,, (2;) = 1 40, it follows from the equations (29) that two successive numbers of the sequence 
HA (a;), Hi (ai), ..., Hm (a) cannot be 0 simultaneously. 


Assume that x; is a root of P(x), thus also a root of H(x). Then the sequence must start with 
H (a;) =0, hence H; (x;) 4 0, and Hj (x;) is either positive as indicated in the main case of Table 5, 
or it is negative as indicated in the alternative case of Table 5. 


If 2; is not a root of P(x), then it is not a root of H(x) either. Hence, the sequence starts with 


It is possible that we later get H), (x;) = 0 in the sequence. Since we never can have two successive 
zeros in the sequence, both Hx 1 (a;) and Hy,+1 (x;) are # 0. Using the formula from (29), 


Ay—1 (ai) = Ae (2) Qe (ti) — Hegi (24) , and = Ay, (xi) = 0, 


we conclude that Hy_1 (a;) and Hy,+1 (x;) must have different signs. The main case in Table 5 has 
+, -, and the alternative case has -, +. 


For every given j, for which H;(a;) 4 0, it follows that H;(x) has the same sign in the two adja- 
cent intervals as at x;. Hence, for every j, for which both H; (z;) and H;,, (x;) are # 0, the pair 
(H; (x), H;+1()) will give the same contribution (either 0 or 1) to W(z) in the two adjacent intervals 
as at Xj. 


Assume then that x; is a root of P(x) of multiplicity m. Then, by Taylor’s formula, 


(m) Lj ee ; (m) Lj et 
P(e) =P) (@_9,) eae and Pa) = TS em) Hens, 


so the two polynomials P(x) and P’(x) must in some interval |x;,7;-+¢[ have the same sign (= 
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x —oo | —2| -1] 0] 1) 2 | +00 
P(a) = 0° — 27 — 307 + In +5 : z 
P'(x) = 52+ — 423 — 9x? +2 - 
P(x) © 342° + 92? — 402-127 | - - - | - | - 
P3(x) & 79x? — 574x + 827 S 4 
P4(x) & —3953x + 7578 A . 
Ps(x) = some negative const. - - - a) Zo) : 
W (a) zi 4 3°} 3]3]1 1 


Table 6: Table of W(x) for « = —2, —1, 0, 1, 2. 


same sign as P(™) (x;)). Hence, H(x) and Hi(a) must also have the same sign in ]x;,2; + [, and 
consequently in the whole adjacent interval to the right of x;. 


Since x; is a simple zero of H(x), the sign of H(x) in the adjacent interval to the left of x; must be 
the opposite one of the sign in the interval to the right of x;. Hence, the pair (H(x), Hi(x)) produces 
no change of sign at x; and in the adjacent interval to the right of x;, while we get one chance of sign 
in the adjacent interval to the left of «;. 


If Hj, (w;) = 0, then there is precisely one change of sign in the subsequence Hy_1(x), Hy(x), Hy+1 (x) 
at x; and in the the two adjacent intervals, no matter the sign of H;,(2). 


Summing up, we have in the adjacent interval to the right of x; the same number of changes of sign 
as at x;. When we consider the adjacent interval to the left of x;, we have found a loss of one change 
of sign, when we pass through «;, if x; is a root of P(a), and no chance in the number of changes of 
sign, when x; is not a root of P(x), and Lemma 3.4.1, hence also Theorem 3.4.1, are proved. 


Notice that if 2 is not a multiple root of P(x), ie. not a root in P,,(x), then W(x) is equal to 
the number of changes of sign in the sequence P(x), P’(x), P(x), ..., Pm(x), and we can avoid the 
division by P,,(«), if we are content with finding the number of roots in intervals of endpoints which 
are not multiple roots. 


Remark 3.4.1 It is usually very difficult and tedious to find the polynomials P2(x), ..., Pin(a) by 
the Euclidean algorithm, which is caused by the denominators of the coefficients. However, in the 
computation of W(x) we only count the changes of sign, so we may, if convenient, multiply every 
polynomial by a positive constant. We use the symbol * to indicate that some polynomial P,(x) has 
been multiplied by some positive constant. > 


Example 3.4.1 Consider the polynomial P(x) = x° — xt — 323 + 2a + 5 of Example 3.3.1, where we 


illustrated Fourier-Budan’s theorem. 


Leaving out the tedious computations we end up with Table 6. 
Since Ps(a) is a negative constant, we conclude that all roots of P(x) are simple. 


The columns corresponding to —oo and +o0 correspond to large positive and negative x, so the signs 
are determined by the terms of highest degree. 
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There are W(—oo) — W(+o0o) = 4—1 =38 real roots. One of these lies in the interval | — 2,—1[, and 
two of them in the interval |1,2[, cf. Figure 11. > 


3.5 Rouché’s theorem 


We shall prove some general theorems from Complex Functions Theory and then apply them to 
polynomials in order to get the information of how many roots (counted by their multiplicity) a 
polynomial has in a domain bounded by a simple closed curve. By varying this curve we may obtain 
information of where the roots are more precisely lying in the complex plane. 


We first prove the following general result. 
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Theorem 3.5.1 Let f : [a,b] > C\ {0} be a continuous function on a closed, bounded interval [a, b], 
where f(t) £0 for every t € [a,b]. Then f has a continuous argument function arg f(t). 


PROOF. a) First assume that there is an a € R, such that 
f@eC\{e=re™|reo} for every t € |a, b], 


i.e. the image of [a,b] by f does not cross the half line {z = e’ | r > 0}. 


f(b) 
—~ 


LA: 


Figure 12: The image f([a,6]) does not cross the half line {z = re’® | r > 0}. 


Choose the argument function of f(t), such that 
a <arg f(t)<a+2n. 

Define the logarithmic function Log, : C \ {z = re’ |r > 0} + C by 
Log, z = In |z| +7 Arg, z, where Arg, z €Ja,a+ 2q|. 


Then Log, is continuous, so the composed map Log, o f is again continuous, hence also the imaginary 
part 


arg f(t) := Arg, f(d). 


b) Then assume that no such a exists. We put 
m = inf{|f(t)| | t € [a, O}- 


Since f is continuous on the bounded, closed interval [a,b], and f : [a,b] + C \ {0}, we conclude from 
one of the main theorems for continuous functions that m > 0. It also follows from another one of 
the main theorems that f is uniformly continuous on [a,b]. Hence, corresponding to m > 0 there is a 
6 > 0, such that 


lf(s) — f@®|<m for all s, t € [a,b] for which |s — t| < 6. 


Choose division points a = to < t1 <--: < tn = 6, such that |t; — t;-1| < 6 for 7 =1,...,n. Then to 
each of the intervals [t;_1,t,] there is an a; € R, such that 


f(t) eC\{z=re% |r>0} for every ¢ € [t;_7,t,] . 
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f(b) 


Figure 13: The image f({a, b]) crosses every half line {z = re’* | r > 0}. 


we may e.g. choose a; = arg f (t;) + 7. 
It follows from a) above that there exists a continuous argument function arg, f(t) on [tj—1,¢3]. 
Furthermore, these argument functions can be chosen such that 
arg, f (t;) =argj41 f (t;) att € [f_a,2,) 11 Esta] =f}. 
Then the argument function arg f(t), defined by 


arg f(t) := arg; f(t) for £¢ [ty atl GS Lesh, 


is uniquely determined and continuous. 


Given one continuous argument function O(t) = arg f(t), any other continuous argument function is 
given by O(t) + 2pm for some p € Z. In fact, it is obvious that O(¢) + 2p7 is a continuous argument 
function, and if O(¢) and ©,(¢) both are continuous argument functions for f(t), then 01(t) — O(t) is 
continuous on [a,b]. Since arguments differ by 2pm for some p € Z, it follows from the continuity that 
p moust be constant in [a,b], and the claim is proved. 


It follows in particular from the above that the difference 

(30) arg f(b) — arg f(a) 

has the same value for every continuous argument function arg f(t). 
We call this difference (30) the argument variation of f along {a, }]. 


If in particular f(a) = f(b), i.e. the continuous curve z = f(t), t € [a, 6], is a closed curve which does 
not pass through 0, then the argument variation is an integer times 27, 


(31) arg f(b) — arg f(a) = 2na for some n € Z. 


The number n € Z of (31) is called the winding number around 0 of the function f : [a,b] — C \ {0}, 
or the closed curve f([a,}]) not passing through 0. 


The winding number is interpreted as the number of times the curve winds around 0, counted positive 
in the positive sense of the plane, and negative otherwise. We notice that counting negative loops 
may cancel some of the counting of positive loops. 
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Aa) = fib) 


Figure 14: The winding number of a closed curve not passing through 0. In the present example the 
winding number is 2. 


The geometrical interpretation above of the winding number is often very easy in practice. We shall 
later prove that the winding number is equal to the difference of the number of zeros and the number 
is equal to the difference of the number of zeros and the number of poles in a domain of an analytic 
function. For polynomials the number of poles is 0, so we get the number of roots, counted by their 
multiplicity. 


Before we can make the statement more precise we need another theorem. 


Theorem 3.5.2 Given two continuous and complex functions f : [a,b] > C \ {0} and g: [a,b] — C, 
for which 


If 
(32) |g@)I<If(@)| for every t € [a,b], 


then the two functions f and f +g have the same winding number around 0. 


PROOF. From the estimate 
f(t) + 9)| = |fO|—-|lg@|>0 for every t € [a, bf, 


follows that both f and f +4: [a,b] — C \ {0} are continuous and that neither of then is 0 in the 
interval [a, 6]. Furthermore, 


fla)= f(b) and = (f + g)(a) = (f + 9)(0), 
so the images f([a,b]) and (f + g)([a, 6]) are both closed curves, not passing through 0. We write 


— fa 4 0 ; 
se rat)=s0)- {14+ 89), te fad, 
( 


It follows from | =~] < 1 that 
Fal 


sao) ce 
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g(t) 


hence 1 + F(t) lies in the right half plane for every ¢ € [a,}]. In particular, the principal argument 


Arg {1 + ah is continuous for t € [a, 0]. 


t 
Choose any continuous argument function arg* f of f. Then arg* f(t) + Arg {1 + aa must be a 


f(t) 


continuous argument function arg(f +g) for f +g. Finally, since 


sett Fal AE Tat 


we conclude that 


arg(f + g)(b) — arg(f + g)(a) = arg* f(b) — arg* f(a) = 2nr, 


and the theorem is proved. 
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The importance of Theorem 3.5.2 lies in the fact that it allows us slightly to perturb closed curves 
without changing their winding numbers. 


An obvious extension of the definition of the winding number is the following: Let 2 C C be an open 
domain, and C a simple closed curve in 9. Let f :Q — C\ {0} be a continuous map. Then the image 
f(C) must be a closed curve in C \ {0}, and as such it has a winding number around 0. 


We shall now restrict ourselves to complex functions f(z) which are either polynomials, or fractions 
of polynomials. Then we introduce 


P 
Definition 3.5.1 Let f :Q— C be a fraction of two polynomials, f(z) = aS. Q(z) 4 0 for all 
Zz 
z€Q. We define the logarithmic derivative of f(z) by 
/ 
FO) for 2 € {2 €D| f(z) £0}. 
F(z) 
Piles wh cid P(z) , 
Hence, the logarithmic derivative of f(z) = Olz) is defined in the set 
z 


O* := {z €C| P(z) £0 and Q(z) £ Of. 


If in particular, f :Q— C(R_ U {0}) does not have real values < 0 for any z € Q, then the principal 
logarithm Log f(z) of f(z) is analytic in Q, and its derivative is 


£2) 
f(z)" 


This is the reason why we in general call the right hand side of (33) the logarithmic derivative of f(z), 
even when Log f(z) is not defined. 


(38) © Log f(2) = 


Theorem 3.5.3 The argument principle. Let f(z) = aE be a quotient of two polynomials P(z) 
and Q(z). Let C be a simple closed curve inQ = {z € C | P(z) £0 and Q(z) £ 0}, and let w be the 
bounded domain of boundary f(C). 

Let N = N(w) denote the number of zeros of the numerator P(z) inw, and R = R(w) the number of 
zeros of the denominator Q(z) in w, all counted according to their multiplicities. 

Then the difference N — R is equal to the winding number of the closed curve f(C) around 0 in the 
w-plane. 

We have more precisely, 


(34) oa f a dz = N(w) — R(w) = winding number of f(C) around wo = 0. 


f(z) 
F(z) 
in the w-plane. Assume that C' is given by the parametric description z(t), t € [a,b]. Then we define 
by g(t) = f(z(t)) a continuous complex function g : [a,b] — C \ {0} with a continuous argument 
function arg g. 


1 
PROOF. We first prove that = $c dz is the winding number of f(C) around the point wo = 0 
Ti 
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Use the same construction as in b) in the proof of Theorem 3.5.1 to conclude that to every subinterval 
[t;,tj;+1] there corresponds a curve C;, which is a subset of C. Then we get the computations, 


1 ffie, Rel | f'(z) ey es g(t) dl 
iil Fo dz 2 ni I dz Qo ni a tm) om [In |9(t)| + 4+ arg g(t)];? 
n—-1 n-1 


sai 2 nla tj41)|—In|g (t Nh 5g De Corea tj4a) — arg g (ty) } 


fe {In |g (én)| — In|g (to) |} + {arg g (tn) — arg g (to) } 


= = fin I9(b)| — in |9(a) |} + 5 ~ {arg 9(b ) — arg g(a)} 


= 0+ winding number of g((a, b]) = f(C) around wo = 0. 


Finally, 
(35) f(z) _ Q() Pz) - Q(z) - Q"(z)- Plz) _ P'2) _ A 
f(z) Pz) Q(z)? Plz) Q(z)’ 


so the claim follows, if only we can prove it for f(z) = P(z), a polynomial. 
Given a polynomial, P(z) = a-(z— 2)"'+--(z— Zp)"”. Then 


P'(z) ny Np 


Piz) 2-4“ | £— By 


sO 


1 a 
dae = 
Qrt (z) Dah st 


We shall without proof use the well-known fact that 


1 dz 1 if zo lies inside C, 
2nt Jo z— % 0 if z lies outside C. 
This gives 
1 Pp’ 
—— (2) dz = number of zeros of P(z) inside C. 
2ri Io Plz) 
Similarly, 
/ 
—— QO") dz = number of zeros of Q(z) inside C, 
2ni Io Q(z) 


so when we integrate (35) the claim follows by insertion. 


Combining Theorem 3.5.2 and Theorem 3.5.3, the argument principle, we easily get 
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Theorem 3.5.4 Rouché’s theorem for polynomials. Let P(z) and Q(z) be polynomials, and let C be 
a simple closed curve in C. If 


|P(z)| > |Q(z)| for every z € C, 
then P(z) and P(z) + Q(z) have the same number of zeros inside C. 
ProoFr. Theorem 3.5.2 tells us that P(z) and P(z)+Q(z) have the same winding number with respect 


to wo = 0 in the w-plane. Theorem 3.5.3 tells us for polynomials that the winding number is equal to 
the number of zeros. 


A simple consequence of Theorem 3.5.4 is another proof of 


Corollary 3.5.1 The fundamental theorem of algebra. Given a polynomial of degreen € N, 
P(z) = agz" 4ay2"") + +++ + an-12 + Gn, where ao # 0. 


Then P(z) has precisely n roots, counted by multiplicity. 


PrRooF. It follows from 


a a 
P(2) = 2" {ao + = + +=}, z#0, 
that there is an R, such that 
lag) 7” > |an| + |an—a| r++--+ laa] r7-* for r > R. 


Putting P,(z) = az” and P2(z) = an + an_12 +++: +a 12" we see that 
|Pi(z)| > | Pa(z)| for every z € C, for which |z| > R. 


It follows from Theorem 3.5.4 that P(z) = aoz" and P,(z) + P2(z) = P(z) have the same number of 
zeros inside every circle C, of radius r > R and the common centre 0. Then the claim follows, because 
P,(z) = aoz” trivially has an n-tuple zero at z = 0 and no other zero. 


Example 3.5.1 Rouché’s theorem does not preserve the multiplicity of a given zero. Given a complex 
constant a € C, where 0 < |a| < 1. Choose P(z) = z” and Q(z) = —a. If C is the unit circle |z| = 1, 
then 


|P(z)| =|2"| =1> lal = |Q(z)|__ for |z| = 1, 


so it follows from Rouché’s theorem that P(z) = 2” and P(z) + Q(z) = 2” —a have the same number 
(= n) of zeros in the open unit disc {z € C | |z| < 1}. However, P(z) has the zero z9 = 0 of multiplicity 
n, while P(z) + Q(z) has n simple roots, all lying on the circle of radius ~/|a| 4 0. © 
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Example 3.5.2 A typical application of Rouché’s theorem is the following. Given the polynomial 
2° + 22? — 50z + 100. 
It has three zeros by the fundamental theorem of algebra. 


These three roots all lie inside the circle of equation |z| = 9. In fact, choose P,(z) = z3 and Qi(z) = 
227 —50z +100. Then we have the following estimate for |z| = 9, 


|Pi(z)|=9% =729 and IQi(z)| < 2-97 +50-9+100 = 712, 
so |Q1(z)| < |P,(z)| for |z| = 9, and the claim follows from an application of Rouché’s theorem. 


There is only one root inside the circle |z| = 4. We again apply Rouché’s theorem. However, this 
time we choose Pj(z)=—50z and Qoe(z) = 2z3+227+100 and get the following estimates for 
Jz] =4, 


|Po(z)| =|—50z|=200 and |Q2(z)| = |z° + 22? + 100] < 64 + 32 + 100 = 196, 


so |P2(z)| > |Qe2(z)| for |z| = 4, and the claim follows, because the only root of P2(z) = —50z inside 
|z]} =4is z=0. 
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t 
Finally, there is no root lying inside |z| = Zz In this case we choose 
P3(z)=100 and = Q3(z) = 2° + 227 — 50z, 


7 
in which case we get the following estimate for |z| = D 


7 (7 )" 7 15 49 7 
= 3 2 | | ZS | 
i. 
2 17 7} -50 = 100 = |Pa(2)), 


and the claim follows, because the constant P3(z) = 100 4 0. 
The roots are numerically computed to be approximately 


—8.889 794 306, 2.658 473 477, 4.231 320 828. 


Figure 15: The roots of z° + 2z? — 50z +100 and Rouché’s theorem. 


It should be noted that if we change the sign of the term —50z, so we instead consider the polynomial 
23 + 22? + 50z +100, then we can without any changes repeat all the arguments above, so we have 


e three roots inside |z| = 9, 
e one root inside |z| = 4, 
e no root inside |z| = s 
It is, however, in this case possible to find the roots directly, since we have 
23 +227 + 502 + 100 = (z + 2) (27 +50), 


so the roots are —2, 5V/2i and —5V/2i. O 
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Example 3.5.3 Consider the polynomial 328” — z? + 1. The degree is 87, so by the fundamental 
theorem of algebra it has 87 roots. They all lie in the open unit disc. In fact, choosing 


P,(z) = 32°" and = Qi(z) = -z° +1, 
we get the following estimate for |z| = 1, 
|Q1(z)| = |-28 +1] <2 <3 = |328"| = |Pi(z)|. 


Since the dominating term P,(z) only has the 87-tuple root of zo = 0 lying inside |z| = 1, the claim 
follows. 


Choosing P2(z) = 1 and Qo(z) = 328” — 23, we get the estimate, using e.g. a pocket calculator, 
|Q2(z)| = |328” — 23| < 0.98 < 1 = |P2(z)| for |z| = 0.96, 


so there lies no zero in the slightly smaller disc |z| < 0.96. Hence, all 87 roots z; must lie in the narrow 
annulus given by 0.96 < |z;| < 1, so it is from a numerical point of view fairly crowded concerning the 
roots in this narrow annulus. For the time being it does not look too promising to find (numerically) 
these roots with any prescribed tolerated uncertainty. That it is possible (though we shall not later 
in this book do it for the given example), follows from an application of Graeffe’s squaring method, 
which will be described in Chapter 4. > 


3.6 Hurwitz polynomials 


In connection with the question of stability of mechanical or electrical systems concerning oscillations, 
it is of great importance to be able to decide whether a polynomial has all its roots lying an the open 
left hand side of the plane. Polynomials of this property of only having roots in the open left hand 
side of the plane are called Hurwitz polynomials. In this connection and aiming at proving stability 
it is, however, of less importance also to find approximate values of these roots. It suffices in most 
applications that they all have a negative real part. 


In order to be more precise concerning what is meant by a Hurwitz polynomial we shall start with the 
following considerations. Given the polynomial 


P(z) =apz" +a)27" "1 +--+ + agg + aq 


of complex coefficients. Then clearly the polynomial 


Po) = Gp 27 +0, 27 +e + Gpnl 2+ Gy = Ane* +012" 1 +--+ + Gn_izZ 4+ Gn, 


where we have taken the complex conjugated coefficients, must have the complex conjugated roots of 


those of P(z). Hence, the roots of the product P(z)P(z) are either real, or complex conjugated of the 
same multiplicity, and P(z)P(z) must have real coefficients. Now, complex conjugation maps the left 
(or right) half plane into itself, so without loss of generality we may in the remainder of this section 


restrict ourselves to only consider polynomials of real coefficients, ag, ..., Gn € R. 


Then we introduce the following more precise definition. 


Definition 3.6.1 We say that a polynomial P(z) of real coefficients is a Hurwitz polynomial, if all 
its zeros lie in the open left half plane Rz < 0. 
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It follows by the fundamental theorem of algebra that 


(36) P(z) = aoz"” +--+ + dn = a0 (2 — 1) ++ (2 — An), z€C, 


where some of the A; may be identical. If P(z) is a Hurwitz polynomial, and a+ iG, a <0, 6 40, is 
a root, then a — if is also a root. Thus, 


(z-a—i8)(z-at+if) =(z-a0)? + 8? = 2 — 2az + (a? + B?) 
must be a divisor of P(z). By assumption a < 0, so this divisor has only positive coefficients. 


If P(z) is a Hurwitz polynomial, and 4 is a real root, then A < 0, so the divisor z — » has trivially 
positive coefficients. 


Hence, if P(z) is a Hurwitz polynomial, then it can be factorized into factors of first or second degree, 
all of positive coefficients. Then it follows after a multiplication that P(z) itself must have positive 
coefficients, and we have proved 


Theorem 3.6.1 A necessary condition that a polynomial 
P(z) =agz" + aye” tee + ay 


of real coefficients is a Hurwitz polynomial, is that all its coefficients have the same sign, either all 
positive or all negative. 


For n = 1 and n = 2 the condition of Theorem 3.6.1 is also sufficient. This is no longer the case when 
n> 3. 


Example 3.6.1 The polynomial 


16z3 + 8274+9z4+17 


1 1 
satisfies the necessary condition of Theorem 3.6.1. Its roots are —1, — +7 and i, so two of them 


have positive real part, and the polynomial is not a Hurwitz polynomial, showing that the condition 
of Theorem 3.6.1 is not sufficient for polynomials of degree > 3. 0 


Theorem 3.6.2 A polynomial P(z) of real coefficients is a Hurwitz polynomial, if and only if 
|P(z)| > |P(—z)| for Rz> 0, 
P(iy) #0 fory ER. 
ProoF. Factorize P(z), 
P(z) = agz” + +++ + Gyn = G9 (2 — A) ++ (Z— An), zeéC. 
Assume that A; = a+i@, where RA; =a < 0. If we write z = x + iy, then 


Jz —Ajl? = (@ — a)? + (y — 8)? 
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If Rz =a > 0, then it follows from a < 0 that 
— 2 
Je— Ay? = (@- a)? + (yf)? > (-2 - a)? + (-y + 8)? =|-2 - HI 
If A; = a is real, i.e. 8 = 0, then it follows straightaway that 
z-al>|—z-al for Rz > 0. 
If instead A, is not real, ie. 8 A 0, then 
|(z — Aj) (2 — Az) | > |(—2 — Ay) (2 - 3) for Rz > 0. 
If therefore P(z) is a Hurwitz polynomial, then 
|P(z)| > |P(—z)| for Rz> 0. 


If « = 0, then |P(z)| = |P(—z)| = |P(éy)| = |P(—iy)| 4 0, because P(z) is a Hurwitz polynomial, so 
it has no zero on the imaginary axis. 


Conversely, if P(z) is a polynomial of real coefficients satisfying |P(z)| > |P(—z)| for Rz > 0, then in 
particular P(z) 4 0 for Rz > 0. Adding the condition that also P(iy) 4 0 for y € R proves that P(z) 
is a Hurwitz polynomial, and the theorem is proved. 
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Theorem 3.6.3 Schur’s criterion. The polynomial P(z) of real coefficients is a Hurwitz polynomial, 
if and only if all its coefficients are of the same sign, and the polynomial 


P(L)P(z) — P(-))P(-2) 


Q(2) = 4 


of lower degree is a Hurwitz polynomial. 


ProoFr. When P(z) is a Hurwitz polynomial we put 


Applying Theorem 3.6.2 on P(z) we get |P(z)| > |P(—z)| for Rz > 0, and |P(1)| > |P(—1)|, so 
|P(1) P(z)| > |P(-1)P(-z)| for Rz>0, 


and also for tz = 0, because then |P(z)| = |P(—z)| > 0 and P(1)| > |P(—1)|. Hence, R(z) is also a 
Hurwitz polynomial, because R(z) 4 0 for Rz > 0. 


Trivially, 


R(z) 


so —1 is a root, and z+ 1 is a divisor, i.e. 1 is a polynomial of lower degree, i.e. 
z 


R(z) _ P(1)P(z) — P(-1)P(-2) 


e Oe at z+1 


is a Hurwitz polynomial. 


We shall then prove that if P(z) is not a Hurwitz polynomial, then Q(z) given by (37) is not a Hurwitz 
polynomial, which will conclude the proof. 


Assume that P(z) has a root iyo on the imaginary axis. Then —7ypo is also a root of P(z), so 


P()P (iyo) = P(=1)P (~iy0) 


; = 0, 
iyo +1 


Q (iyo) = 


and iyo is a root of Q (iyo). This shows that Q(z) is not a Hurwitz polynomial in this case. 
Assume that P (zo) = 0, where tz > 0. Then 
(zo + 1) Q (20) = —P(-1)P (—20) , (—20 + 1) Q(—20) = P(1)P (—20). 


If also P (—zo) = 0, then it immediately follows that Q (zo) = 0, and Q(z) is not a Hurwitz polynomial 
in this case. 


Therefore, the remaining possibility is that P (zg) = 0, while P (—z9) #4 0. Then apply that ao, ..., dn 
all have the same sign, so 


P(-1)| = |(—1)"ao + (—1)"- 1a, +--+ an| < lao +a, +---+a,| = P(1)]. 
It follows that 


(zo + 1) Q (20)| = |P(—1)| - |P (—20)| < [PC]: 1P (-20)| = I(-20 + 1) Q(—20)|- 
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Now, 2 > 0, so it follows immediately from Theorem 3.6.2 that (z+ 1)Q(z) is not a Hurwitz 
polynomial, thus Q(z) is not a Hurwitz polynomial, and Schur’s criterion is proved. 


Since deg Q < deg P, Schur’s criterion may be applied at most n — 2 times before we can conclude 
whether P(z) is a Hurwitz polynomial or not. 


Example 3.6.2 Given the polynomial P(z) = z° + 22? + 3z +1 we compute by Schur’s criterion, 


{7 (2° + 227 + 32 +1) — (-1) (—2° + 22? — 324+ 1)} =627 + 102 +8. 


Q) = 


It follows that Q(z) is a polynomial of second degree of positive coefficients, so by the remark following 
Theorem 3.6.1, the polynomial Q(z), hence also P(z), is a Hurwitz polynomial. © 


Example 3.6.3 Given the polynomial P(z) = z° + 2z? + z +3 we compute by Schur’s criterion, 


1 
z+l1 


Q(z) = {7 (22 + 227 + 24+. 3) —3(—29 + 227 — 24 3)} = 102? — 224-12. 


Clearly, Q(z) is not a Hurwitz polynomial, so P(z) is not a Hurwitz polynomial either. > 
We mention without proof the following most commonly used criterion of Hurwitz polynomial. The 
proof is very long and tedious. 
Theorem 3.6.4 Hurwitz’s criterion (1895). Given a polynomial 
P(z) = 092" +a,2 1 + ++ + Gniz t+ a, 


of positive coefficients. Then P(z) is a Hurwitz polynomial, if and only if the following system of 
inequalities is fulfilled, 


a, ago 0 
a1 ao 
D, =a, >0, Dg = naar > 0, D3=)] a3 ag a, | >0, ‘ 
2 
x a5 a4 43 
ay ag 0 0 
a3 ag ay 0 
D, = : > 0, 
G2n-1 G2n-2 G42n-3 *** An 


where we have put a, =0 fork >n. 


Example 3.6.4 (Cf. also Example 3.6.2). We get for the polynomial P(z) = z3 + 227 +3z+1, 


=5>0, 


| 
on 
V 
S 
= 
| 
oOrn~ 
owrF, 
ENO 


D,=2>0, Da=| 


so P(z) is a Hurwitz polynomial, and all three roots lie in the left half plane. © 
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Example 3.6.5 (Cf. also Example 3.6.3). We get for the polynomial P(z) = z3 + 227 + 24+ 3, 
2 1 
Da=| 5 | [=-1<0 


Hence, there is at least one root satisfying #z > 0. Assuming that z = iy is purely imaginary we get 
P(iy) = (3-2y") +iy(1—y?) #0 for ally ER, 


so there is at least one root of positive real part. It cannot be real, so we have two complex conjugated 
roots of positive real part. The approximate values of the roots are 


—2.174559 41 and 0.087 279 7 + 1.171312 1%. ©) 
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4 Approximation methods 


4.1 Newton’s approximation formula 


This is usually derived from Banach’s fiz point theorem.. We shall not formulate and prove the latter 
theorem in its full generality in a complete metric space, but only consider the space of real numbers 
R, or the space of complex numbers C. 


Definition 4.1.1 We say that a map f : R-—R, or f : C —C, is a contraction, if there exists a 
d € [0,1[, the contraction factor, such that 
f(y) —fl@)| <Aly-—2| for all x, y ER (or € C). 


A contraction is clearly continuous, and if we put f°” = fo---o f, ie. a composition n times, then 
f°” is a contraction of contraction factor \”. In fact, 


Ifor(y) — fora)! = [(F2"-DQy)) - F (FY @))| < al POD) - foor-D(@) 


IA 


and the claim is proved. 


Definition 4.1.2 Let f: R-—R, or f: CC, be given. We say that xp € R(C) is a fix point of f, 
if f (xo) = 2o.- 


We can now prove Banach’s fix point theorem for R and C, and we notice that if we follow the same 
proof with obvious modifications, the theorem is proved in general for complete metric spaces. 


Theorem 4.1.1 Banach’s fix point theorem in R, or C. Every contraction on R or C has precisely 
one fix point xo. If x € R(or C) is any given point, then f°" (x) > xq for n > +00. 
Proor. Assume that f :R— R (of f : C > C) is a contraction of contraction factor A € [0, 1[. 
Uniqueness. Assume that x; and x2 are two fix points of f. Then 

0 < |x1 — x9| = |f (w1) — f (v2)| S Alar — ao], 


because f is a contraction. Since 0 < A < 1, this is only possible, if x; = x2, so the contraction has 
at most one fix point. 


Existence. Choose any fixed a € R (or C), and consider the sequence (f°”"(a)). Then for every n € N, 


FOP (a) — FFY (a)| = [f"(a) — F"(F (2) S Ale — F(2)L, 


because f°” is a contraction of contraction factor \”. 
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Then for any given n, k €N, it follows from the above that 


7?" (x) = foirt¥) (a) < fore Dg) = fort *) (7) 


Fom(a) — foo (a)| +--+ 


IA 


{arp AM be AMAT} Jar f(x) = a" fa Fa) 


rN” 
< 
~ 1-X 
From A € [0, 1[ follows that 


ja — f(x). 


A” | 
1—A 


x— f(x)| 0 for n — +00, 


hence (f°"(x)) is a Cauchy sequence in R (or C). Since R and C are complete, the Cauchy sequence 
(f°"(a)) is convergent with the limit xo, say, thus 


f(z) 420 ~=— for n > +00. 
Furthermore, since f is continuous, 
f (f°"(z)) = Fee) — f(x) for n — +oo. 


The limit of a convergent sequence in R (or C) is unique, so we conclude that 29, constructed in this 
way as the limit of the sequence (f°"(x)), is indeed a fix point. 


We are only considering polynomials, so there is no need to formulate Newton’s iteration method for 
real C? functions. If a polynomial Q(x) has multiple roots, then we know from Chapter 2 how we can 
find another polynomial P(x) of precisely the same roots, all of them, however, only of multiplicity 
1. Therefore, we can without loss of generality in the following assume that the polynomial P(x) has 
only simple roots, i.e. if P(x )) = 0, then P’ (ao) 4 0. 


Theorem 4.1.2 Newton’s iteration method for polynomials of only simple roots. Let P(x) be a 
polynomial of only simple roots, and assume that xo is a (real or complex) zero of P(x). If we choose 
x1 sufficiently close to xq and define 


_ Pits) 
P' (fas) 


ni = f (Sy) =n forneN, 


then Lp — Lo for n > +00. 


PROOF. Since wo is a simple zero, we must have P’(x) 4 0 in an open neighbourhood Q of zo. Hence 
the map 
P(x) 
P\(a)’ 


f(a) :=2 for « €Q, 


is of class C°(U), and even an analytic function in U, because P(x) is a polynomial. It follows that 


_P(x)P"(2) _ PP") 


Pc) {P(2)}? {P(e}? 
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In particular, f’ (wo) = 0, because P (ap) = 0. Since 


Xx _ 
(OO). yey 
vty 
for x and y in a small neighbourhood of zo, there exists to every A €]0,1[ an r, > 0, such that 
fe)-FMISAle—y| — for |x — ao], ly— 20] <r, 


and f is a local contraction on B (29,7). The theorem then follows from Banach’s fix point theorem. 


Although Newton’s iteration method usually is very efficient, there is, however, a drawback, because 
the contraction in the proof is local. If one of the elements 2, of the iterative sequence does not lie 
in B(ao,r,), then we cannot conclude anything about its successor. This could easily happen with a 
bad choice of x1, because we neither know zo (we are going to find x by this method) nor the radius 
ry. It is therefore in general a matter of a lucky choice of the starting point x1, if this method is going 
to be successful. 


Usually one only applies Newton’s iteration method in the case of a real polynomial, i.e. of real 
coefficients. 
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Theorem 4.1.3 Let P(x) be a polynomial of real coefficients of degree > 3, and assume that it has a 
simple real root xo lying in some interval ja, bf. 


1) If P'(x)- P"(x) > 0 in Ja, b[, then the Newton iteration method converges towards xo, if we choose 
the right end point x; = b as our starting point. 


2) If P'(a)-P" (x) <0 in ja, b[, then the Newton iteration method converges towards xo, if we choose 


the left end point x1 =a as our starting point. 


PROOF. We shall prove Theorem 4.1.3 by some simple graphical considerations, cf. Figures 16-19. 


1) Assume that P(x) is convex and increasing in a neighbourhood of zo, i.e. 
P'(x) > 0 and P” (x) > 0 in a neighbourhood of zo. 
If we choose the starting point x, to the right of the root xo, it follows readily from Figure 16 that 
> tg >%3>++: > Xo, In \, Xo, 
because a bounded decreasing sequence is convergent, and xg is the only possible limit point. 


24 


Figure 16: If P(x) is convex and increasing in a neighbourhood of xo, choose the starting point to the 
right of x and obtain a decreasing sequence converging towards zo. 


2) Assume that P(a) is convex and increasing in a neighbourhood of the root 2p, i.e. 
P'(x) <0 and P” (x) > 0 in a neighbourhood of zo. 
If we choose the starting point x, to the left of the root xo, it follows readily from Figure 17 that 
UT <%2<%3< +++ < Xo, Ln / Xo, 
because a bounded increasing sequence is convergent, and xg is the only possible limit point. 
3) Assume that P(«) is concave and increasing in a neighbourhood of Zo, i.e. 
P'(x) > 0 and P” (x) < 0 in a neighbourhood of zo. 
If we choose the starting point x, to the left of the root xo, it follows readily from Figure 18 that 
T1<%Q<%3< ++: < Xo, Ln / Xo, 


because a bounded increasing sequence is convergent, and x9 is the only possible limit point. 
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Figure 17: If P(x) is convex and decreasing in a neighbourhood of xo, choose the starting point to 
the left of zp and obtain an increasing sequence converging towards 29. 


Figure 18: If P(a) is concave and increasing in a neighbourhood of 20, choose the starting point to 
the left of zp and obtain an increasing sequence converging towards 2. 


4) Assume that P(x) is concave and decreasing in a neighbourhood of 2p, i.e. 
P'(x) <0 and P” (x) < 0 in a neighbourhood of zo. 
If we choose the starting point x1 to the right of the root xo, it follows readily from Figure 19 that 


L1>%2>%43>+::> 20, In \, Xo, 


because a bounded decreasing sequence is convergent, and 29 is the only possible limit point. 


Example 4.1.1 We consider the polynomial P(x) = x — 2x — 5 from Example 1.5.1 (1). Then we 
6 
know already that there is a real root > “ By insertion, 
P(Ql)=1-2-5=-6, P(2)=8-4-5=-1, P(3)=27-6-5=16, 


so the real root lies in the interval [2,3], which can also be seen from the graph. 
It follows from 


P'(z)=327-2 and P"(z)=62 


that P’(z) > 3-2? -2 = 10 and P’(x) > 6-2 = 12, hence P’(x) - P(x) > 0 in this interval, so 
we choose the right end point x, = 3 of the interval as our starting point. Notice that even if xo 
apparently lies very close to x = 2, the best strategy is to choose x, = 3. 


86 


Download free eBooks at bookboon.com 


Methods for finding (Real or Complex) Zeros in Polynomials Approximation methods 


24 


Figure 19: If P(x) is concave and decreasing in a neighbourhood of xo, choose the starting point to 
the right of x9 and obtain a decreasing sequence converging towards x9. 


x, 
Le : 7 
-s4 


Figure 20: The graph of P(x) = x3 — 2x — 5. 


The iteration map is 


f(a) P(x) 1 3@9-6r—-15 2 1 4%+4+15 
v)=2@ =2 : = x- 2 , 
P'(x) 3 3x2 — 2 3 3 302-2 
so the iteration formula becomes 
2 1 42,+15 


Tn4t1 = 3 un t 


4 302 —2° 

Using just a simple pocket calculator we get with 7; = 3 > 20, 
r= 3, Ly = 2.36, x3 = 2.12720, 
v4 = 2.09514, t5 = 2.09455, xg = 2.09455, 

so an approximate value of the real root is a, * 2.09455. 


Then using Vieti’s formule we see that the two complex conjugated roots a2, a3 must have the 
structure 1.047274 + iy, because a; + a2 + a3 = 0, and Ray = Raz. Furthermore, 


y+ a2 + a3 = (—1)3 - (—5) = 5 = 2.09455 {1.0472757 + y*} , 


from which 
2 5 


= — 1.047275? = 1.29036 
2.09455 ; 


y 
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so y = 1.13594. The three roots are therefore approximately 


ay = 2.09455, ag = 1.047275 + 7+ 1.13594, a3 = 1.047275 — 7+ 1.13594. ©) 


Example 4.1.2 In Example 1.5.1 (3) we considered the polynomial P(x) = x+ +12x? + 96x —12 and 
showed that it had two real and two complex conjugated roots. From 


P(0) =—-12 and P(1)=97 
follows that we have one real root in the interval ]0, 1[. 
From 

P(-3)=—111 and = P(—4) =52 


follows that we have another real root in |] — 4, —3[. 
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These rough results can also be obtained by considering the graph of Figure 21. 


Figure 21: The graph of P(x) = x+ + 12x? + 96x — 12. 


Then we compute 
P'(a) = 42° + 240 + 96 = 4 {2° + 62 + 24}, 
P!(x) = 1207 +24 = 12 {x7 +2}. 


Clearly, P’(a) > 0 for all a € R, and P’(x) > 0 for x €]0,1[. Hence, P’(x)- P(x) > 0 for x €]0,1|, 
so we choose x; = 1 as our starting point in this interval. 


Then, since P” (x) > 0 everywhere, P’(x) is increasing. It follows from 


P(—3) = 4{-27 — 18 + 24} = -84 <0 


that P’(x)-P”(x) <0 for « €] —4,—3[. Therefore, the starting point of the iteration in this interval 
is chosen as 7, = —4. 


The iteration map is given by 


fice P(x) _ ti +122? + 96x - 12 Sa 3. ag? 412-2 


so the iteration formula is given by 


3 3 2 +12r, —2 
Wnt 4m 9° 33 + 6a, +24 


Choosing x; = 1 we get successively, 


va i ; = — = ; : a = 0.21774, 
2449. 7 

£3 = : 0.21774 : ree - 2 a 7 Stipe 
ae 7 

= ; 0.12418 : — z 2 a ] pasate. 
2449. 7 

5 = ; - 0.12310 : , oe 7 2 - _aoste. 
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so an approximate value of this real root is a, = 0.12310. 


Then we turn to the root in the interval [—4,—3], where we should choose x, = —4 as our starting 
point. However, in order not to make errors in the computations, due to the minus signs, we instead 
introduce 2, = —y,. Then the iteration formula becomes 

3 _3 y2—12y,-2 3 3 12y, +2— y2 


Hotl 4 48 5° 94 Gy, —y2 4°" 2D 8 + Oy, + OA 


Choosing y; = 4 we get 


aBt 5. PR a3 + 2 = 3.70088, 
y3 = : - 3.79688 ; ae oa SS = 3.77668, 
ye ante OE trem: 
Ys = : - 3.77649 : a 2 TO = 3.77649, 
and we conclude that an approximate value of this real root is ag = —ys = —3.77649. 


To find the complex roots we apply Vieti’s formulze. The sum of the roots is a, + a2 + a3 + a4 = 0, 
Xe) 


a3 + a4 = 3.77649 — 0.012310 = 3.65339. 


Since these two roots are complex conjugated, 
Rag =Raxz= ; - 3.65339 = 1.826695, 

and 
a3 = 1.826695 + iy, a4 = 1.826695 — iy. 

The product of the roots is a1 - ag - a3 - a4 = (—1)*- (—12), thus 
—12 = 0.12310 - {-3.77649} - {1.826695? + y}, 


and 
9 > 12 
y~ + 1.826695° = 0.12310 3.77649 ~ 25.81278, 
from which 


y” = 25.81278 — 1.826695? = 22.47597, 
so y = 4.74088. 
Summing up, the four roots are approximately given by 


a3 


al = 0.12418, Q2 = —3.77649, « \ = 1.826695 + 7 - 4.74088. ©) 
4 
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Example 4.1.3 Cf. also Example 2.4.2. The polynomial P(x) = x* — 323 + 5a? + a — 4 has the real 
root x = 1, and we get by a division that 


P(x) =(x-1)Q(x), where Q(x) =29— 202743244. 


lon 


The possible rational roots of Q(x) are 2, +4, and it is easily seen that this set of possible roots 
can be reduced to —1, —2, —4. We finally a by insertion, 


Q(-l)=-2, Q(-2)=-18, — Q(-4) <9, 


so Q(x) does not have rational roots. 


%y 
/ = d a 
ae 
a 


Figure 22: The graph of P(x) = 2° — 22? + 3a +4. 


Since a? — 2agag = (—2)? -2-1-3=4-—6 = —2 <0, it follows from Theorem 1.4.1 that Q(z) must 
have non-real roots. These are complex conjugated in pairs, because the coefficients of the polynomial 
are real. Now, n = 3, so we can only have one such pair, and we have precisely one real root. It follows 
from the continuity and Q(—1) = —2 and Q(0) = 4 that this real root lies in the interval ] — 1, 0[, 
which can also be seen from Figure 22. 


By differentiation, 
Q' (x) = 3a? — 42+ 3 and Q'(x) > 0 for « <0, 
Q" (x) = 6a — 4 and Q"(x) <0 for x < 0. 


Since Q’ (x) -Q” (x) < 0, we choose the left end point 2; = —1 < xp = ay as our starting point of the 
iteration below. 


The iteration map is 


Q(z) a? — 22? + 3x+4 
fe) =2-Z i =2 ; 
Q' (x) 3x? — 4a +3 
where it is no help to further reduce the fraction, so the iteration formula becomes 
eae x? — 2x? + 32,+4 eee (5) 0 nF 43 (a, 4 
i 322 —4tn +3 "7 B(-a,) 44a) +3 


We get using x; = —l, 


(-1)3-2-3+4 || 2 


= 1 — —t 
i S444 10 
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0.83 +2-0.82+3-0.8—4 
aes ae = —0.77635 
* + 3.0.82 44-0843 


0.776353 + 2 - 0.776357 + 3 - 0.77635 — 4 
Bh NES 3 - 0.77635? + 4 - 0.77635 + 3 =e 


0.77605% + 2 - 0.77605? + 3 - 0.77605 — 4 
pete eneeye 3 0.77605? + 4 - 0.77605 + 3 ange 


The real root is approximately given by a; = —0.77605. 


Then use Vieti’s formulze and that Raz = Rag to get 


Q,+02+ 03 = —(—2) = 2, thus a2 + a3 = 2.77605, 


and therefore ag, a3 = 1.388025 + zy. Finally, 


ayaza3 = (—1)°4 = —4 = —0.77605 - {1.3880257 + y}, 


§ World class 
15 January research 
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from which 


4 
2___* __ _ 1 3ggp95? = 3.29769 = 1.79658 
Y= 0.77605 - 


and the three roots are approximately 


ay, = —0.77605, Q2 = 1.388025 + 7 - 1.79658, a3 = 1.388025 — 72- 1.79658. ©) 


Example 4.1.4 Finally, we shall consider the polynomial P(x) = 32°’ — 23 + 1 of Example 3.5.3. 
We know already that it has precisely one real root, and that Rouché’s theorem implied that all roots 


24 
lie in the annulus 35 < la] < 1. Clearly, 


P(x) = 32" —2?+1>0 for0O<a <1, 


so the real root must lie in the interval ] — 1, —0.96], cf. also Figure 23. 


al 


Figure 23: The graph of P(x) = 32°" — 23 +1. 


In order to avoid mistakes concerning the minus signs in the iteration process we put y = —2, x = —y, 
so we shall instead find y €]0.96, 1[, such that 


Q(y) = 3y*""-yF-1=0, y €]0.96,1[. 
By differentiation, 
Q'(y) = 4 {87% — y?} > 0 for y €]0.96, 1[, 


Q" (y) =6 {3741 y® — y} >0 for y €]0.96, 1[. 
Since Q’(y)Q”(y) > 0, Theorem 4.1.3 tells us to choose the right end point y; = 1 of the interval as 


our starting point for the iteration process. 


The iteration map is 
fopny- 28 ag 
Q’(y) 3187 9% —y?}’ 
hence the iteration formula becomes 


_ Sy, =o, = 
Yn+1 = Un 3 {87 ye = yey 
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Choosing y; = 1, cf. the above, we get 


3-1-1 1 


ie, = = 0.99612 
me 3{87 — 1} 3-86 


3 - 0.996128" — 0.996123 — 1 


= 0.99612 = 0.99530 

she 3 {87 - 0.996125% — 0.996122} 
3 - 0.9953087 — 0.995303 — 1 

= 0.99530 = 0.99527 

ue 5 {87 - 0.995308 — 0,995302} 
-0.9952787 — 0.995273 — 1 

ey mee et = 0.99527, 


3 {87 - 0.9952786 — 0.995277} 


so using this particular pocket calculator we conclude that the real root is approximately 

a, = —0.99527. Since the exponent, 87, is very large, one should double check this result, because we 
do not know the programs used in the pocket calculator. Figure 23 was created in MAPLE, so quite 
another program. Looking at Figure 23 the found approximate value above looks very plausible. © 


4.2  Graeffe’s root-squaring process 


The most well-known approximation method, when we shall find the roots of a polynomial, is of 
course Newton’s approximation theorem, which was treated in Section 4.1. There exists, however, 
another method, which is less known, and yet it is in some cases superior to Newton’s approximation, 
in particular when the derivative P’() is very small in a neighbourhood of a zero. This method is 
called Graeffe’s root-squaring process after the Swiss mathematician C. H. Graeffe (1799-1873), who 
published this method as early as in 1837. Although the method may seem troublesome at the first 
glance, it may still have some advantages, in particular when one has a computer - or just a pocket 
calculator - at hand. 


4.2.1 Analysis 


1) Description of the squaring process. Assume first that all the roots of the normalized real polyno- 
mial equation 


(38) P(z) =a" + aye" 1 + age”? +--+ + a,_1t +a, =0 


are all real and mutually distinct in absolute value, thus in particular all simple. We shall for 
convenience, which will become clear later, write them in the form 


—T1, —T2, —T3, -.-, —Tn; where |—r1| > |-re| >--- > |-ral, 
arranged in decreasing order of the modules from —r; to —rp. 


We rearrange (38) in such a way that all even powers are on one side of the equality sign, and all 
the odd powers on the other side of the equality sign, 


a” +agx"? +--+ =—{aja" + a3x" 3 +--+}, 
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and then square both sides, 


gen + Ce aa + i lll Tree ig Ee i + ao + gaa Bees 
4+2a0?"~? + Qaga??4 4-0. | +2a,a3x7"—4 + 2ayasz?"-® + --- 
+2aga4x?"—© + --- 7 +2aza5x2"—8 + 


Collecting all terms on the left we get after a reduction, 


gn {ai — 2a} gen {a3 — 2aya3 + 2a4} gen4 


(39) {a3 — 2aga4 + 2a,a5 — 2ag}x?"-° + --- =0. 


The idea is that every root —r; of (38) is also a root of (39), because if 
Plat = fan eal) =: 

then the process above is described by 
Proven (—r;) = —Pryaa (—r;) and on (-r;)° = Praa (-r;)° , 


so (39) is written 


Pron (Py) = Poaalhy) = A Pree) + Pass —1)} Pea) — Poa 9} 
P05) 4 Lee (07) = Pen = 0 


In the next step we let y := —a? in (39. Then notice that 
wd = (ayy) = (aye (ay, 
so by this substitution (39) becomes 


y'” — {a} — 2ag} y" 1 + {a3 — 2aya3 + 2as}y"? 
(40) {a3 — 2azaq + 2aya5 — 2ag}y"* +--- = 0. 


From y = —2? follows that the roots of (40) are 


2 . 
ywi=—{ony=-rj,  f=,....n, 


so (40) has the same degree as (38), and its roots are obtained from the roots of (38) by a squaring, 
followed by putting a minus sign in front of them. 


Repeating this process we find at step number & some polynomial equation 
(41) af + ay pre) + aopee 7 Hee + Gn—1, bk + An,~ = 0 
of the n real roots 

gk k 


k 
(42) —r, 15, cess —r 


The process above is the same also when some of the roots have the same modulus, including the 
case when we have pairs of conjugated complex roots. It is, however, easiest to describe under the 
assumption that all roots are real and of mutually different modules. We shall first analyze this 
simple case. 
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2) The case of only simple real roots of mutually different modules. Since |ri| > |re| > +--+ > |rn| by 
assumption, we obtain for “large” k that 


oF or Qk 
(43) [rif > [ral > > Ira! 
so the roots =r?" are very different in absolute value. This can be used to find approximate values 
of each root =r?" To see this we apply Vieti’s formule on the equation (41) of the n separated 
(simple) roots (42). We write for short m = 2*. Due to the minus signs in (42) we get 


et ere eee ee 
Gan Hr {rete treOb try {re tee tre btee, 
(44) O34 =O {rere pe bare (rere pe bp, 


mm 
nd 


m 


— yM,™m 
An,k = T1772 °° Ty 
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Applying (43) on (44) we see that we have approximately 


apart; 
a2,h = TTS", 
— MN mM mnIN 
(45) Q3,k =, To r3 ; 


— MN mI™M pM m 
An,gk =T17TQ7T3 °° Ths 


because all of the other finitely many terms of (44) are much smaller than the leading term in (45). 
It then is very easy to find r; from (45), because 


ry’ = 41k; ri] = are, 


46 : ie. where m = 2". 
ie) im Qj,k Gjk 
pir — —4 Ale, yee aes 
j P53 ’ 
aj—1,k Aj—1,k 
ae : : 
rm = OK _ Gn, k 
An—1,k lmrl = % ar ee 
n— ? 


Clearly, we cannot determine the sign of the roots by this method, but these can easily be found, 
either by a graphical consideration, or by the theory of the previous chapters, or by simply inserting 
+r, into the original polynomial equation (38) 


The analysis above was based on the assumption that |r1| > |r2| >--- > |rn|, and that the roots 
—T1, —T2,..., —Tn are all real. 


3) The case of precisely two roots of equal modulus. We shall now assume that all roots are real and 


that just two of them are equal in absolute value, say |r3| = |r4|, so r4 = +r3. We apply the same 
method as described above and finally arrive again at (41) and (42), where of course —r2° = —r?" 
for k € N, because then all the exponents are even. Recalling that 2* = m, formule (44) are then 
written 

ak =TT, 


— pNypmM 
a2k=T,71T., 


a a ee ee 
ag = OPP po = Pegg bo, 


— pMNpMp2MpM m 
An-Vk =TEPE TZ TS Ppa tet 


— pMy,mM,,2M,,.M m 
Onk =T1,%g1T3Z Tr tlh, 


97 


Download free eBooks at bookboon.com 


Methods for finding (Real or Complex) Zeros in Polynomials Approximation methods 


where the dots indicate numerically smaller terms. Hence, (45) is here replaced by 


— »m 
Q1,k — ry oy 

— pill,in 
Q2,k = Ty T5 3 


MN pM pat 
a3,k = 2rP'rg'rs’, 


A7 
ae a4, = ery re”, 


— pMpMp2MpM .., 
Qn-1,k = 1, TQ 7T3Z Th tT 


— pMpMp2MpM , eM 
An,k = 111273 7S 1 tie 


The only difference between (45) and (47), when |r3| = |r4| is that cz = 2rj’rt’r}", where we get an 
extra factor 2, because in the following the product r3™ = r3"ry is unchanged from the previous. 
Therefore, (46) is replaced by 


ae T1] = X/Q@1,k, 
Ty = 41,k; 
(2k 
ry = 2k ee ak 
Q1,k : 
a3,k a3,k 

ST Se |= ala : 
(48) 2a2,k >, thus 2,4 where m = 2", 

a5,k 

iy ? 


a5,k 
m ? 
Ts 7% 5) T5| = 7 
4,k G4,k 


Qn,k , 

m ? 

ei Ira = 9/—* 
n—1,k n Gn ca 


so the difference is that |r3| = |ra| = — We can for large m estimate the number of equal 
\/ 2a 


modulus, because 
=) 2m,,2m,,2mM | __ 1 2 
a3 .k+1 = {ri TT, 13 } = 3 {a3,x} , 


so we just compare two successive coefficients (with respect to &), if the lower number is squared 
or not, when we pass from k to k +1 for large k. We notice that since %/2 — 1 for m — +00, we 
may above instead use the simpler estimate 


Q3,k 
|r3| = |ra] © 7/——a2,4, 
tf 


because in the limit, k > +-oo, or m — +00, we shall get the right values of |r3| = |ral|. 


4) The case of precisely p roots of equal modulus. Similarly, if e.g. |rs3| = |ra| = |rs| (three roots of 
equal modulus), then we get for large m = 2", 
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and generally, if |rj| > |rj4i] = |rjta] =--- = |ry4pl > Iry4pgil, then 


Aj+1,k Aj+1,k 
(49) |rjsal = Iria] = +++ = Irytpl 7 gf for large m = 2", 
D°43,k 5k 


because %/p — 1 for m — +oo. 


We can for large m estimate the number p of roots of equal modulus, because for large k, 


1 


me 2 OF 1k 
Qj+iktl © —* Aj41 ks or Pen aes 


Dd 
we 


Qj+1,k+1 


s 


The case of precisely one pair of complex conjugated roots with different modules from all the other 
roots. We shall now consider the case where the given equation (38) contains one pair of simple 
complex conjugated roots. It suffices in the analysis only to consider a polynomial equation of 
fourth degree of the roots 


TI, —Tr2 a", —Tr2 eo —T3;, 
where we assume that |r| > |rg| > |r3|. Then the polynomial can also be written 


(a +11) (@+ roe) (e+ rae?) (2@ + rs) = 0. 


When we perform k root-squaring operations and put m = 2*, then the resultant equation has the 
roots 


so the corresponding polynomial can be written 
(z+r7") (z +r," en) (z+rz en) (z+ rz") =0, 
which we rearrange as 


oat ine + rq” emo 1 rs gant a ry} 23 


(50) + {rf'ry eS pi ec IMO 4 pM AM 4 pdm 1 nme eimO 1 pmpm ee) 2? 
+{ am MpMpM oimO | MpMAM o—imO 4 eet fpp2mpm — 9 
ryrs ry Ts Tye ry rg Ty € ry re betrp rs rs =0. 


We notice that the first terms in (50) are dominating in the coefficients of z° and z. Then we turn 
to the coefficient of z?, which is also written 


2rP rl? cosmO + rPrh + r3™ + Ir cosmO. 


If cosm© is approximately +1 or —1, then 2r{"r3" cosmO is numerically dominant. If instead 
cosm@ is approximately 0, either ri’r3 or r3™ become dominant. Therefore, if m increases, 
then the coefficient of z? continuously fluctuates in sign, which is very unlike the coefficients 
corresponding to real roots, which remain positive. Hence, we can identify complex conjugated 
roots by this fluctuations of the corresponding coefficient. Clearly, the restriction to a polynomial 
of fourth degree is of no importance. The observation above holds for general real polynomials. 
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Approximation methods 
Once we have identified a pair of simple complex conjugated roots, like in (50), the modules is 
found in the following way, where we first analyze (50), where r3* should be found. For large m 
we have approximately 


4 3 2 2 2 
e+ rP 2? +27 (11,172,713, 0) 2 trp rg 2 trp rg” ry’ = 0, 


from which we derive that a, , =r and ag, = rr? and ag, =rre™r®. We see that we shall 
1k 1 3,k 179 ; 172 73 
neglect the fluctuating coefficient a2,x (71,172,173, ©) and only consider 

rt” = a1,k, 


T1| = ®/@1,k, 
p2m = G3, T9| = 2m 03k 
(51) ; ak thus ak where m = 2". 
_ G4,k ace 
ts rn | = m/ 
3,k oe C3. 
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The general method is to neglect the fluctuating coefficient a;,, (71,...,7n,©) corresponding to a 
pair of simple complex conjugated roots, and then the modulus |r;| of the two complex roots is 
given by 


Then recall that we have assumed that there is only one pair of complex conjugated roots u + iv. 
In order to find wu and v we notice that it follows from Vieti’s formule after a rearrangement that 


was {a, + sum of all real roots in the polynomial} , 
so the real part is easy to find. Then 


Qj+1k . F 
w+uv=r2= m/tt* implies that v = ,/r2 — u?, 
j Oe VG 


so we have also found the imaginary part. Thus for a single pair of complex conjugated roots u+iv 
the method above should be straightforward. 


6) Several pairs of complex roots. If there is at least two pairs of complex conjugated roots, the 
description of the method becomes more complicated. First of all one identifies all modules |r;| 
and find all the real roots by the previous described method. 


The complex roots are then written in the form 


Be pat 2 py. 
Uj = 1V;, where uj + uj; =r}. 


Then we apply some of Vieti’s formulze, starting with a1 = a1,9, and then an—1 = @n—1,0 in case of 
two pairs, etc., and then find some equations of the unknown real parts u;. One should of course 
choose the least complicated of Vieti’s formule: in this process, but it must be admitted that if 
there are many pairs of complex roots, then we are forced to solve a very complicated system of 
non-linear equations in the u;. We shall later illustrate this by an example with two pairs. 


4.2.2 Template for Graeffe’s root-squaring process. 


Let 
Po(x) = &” + aya"! + aga? +--+ ane + Gn 
be a given normalized polynomial of real coefficients. 


1) We shall define the coefficients of the polynomial 
Py(y)=y" tay” + ayy 72 +---+al_sytdl, 


in the next step of the root-squaring process. This is described in (52), where we get the aj by 
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summing the j-th column. 


1 ay ag a3 a4 as 
2 2 2 2 2 2 
1 ay a5 a3 a4 as 


2 
(52) —2ag —2a,;a7 —2a2ag 
2ag 2a a9 
—2a109 
1 a a as ay as 
Then repeat (52) on the new coefficients (1, a),a@5,...,a/,), etc.. It cannot be told in advance how 


many times this should be done. A rule of thumb is that one should stop when each computed 
coefficient a;,41 (with the exceptions of the cases of multiple roots, or complex roots) is roughly 
the square of the preceding one aj,x, i.e. aj, 441 © fajn} for all coefficients which are not connected 
with multiple roots or complex conjugated roots. This happens when the corresponding double 
products are negligible compared with the square, so in practice it is easy to see when one should 
stop. This will be evident from the examples in Section 4.2.3. 


2) If all coefficients are positive, so all roots are real, and if furthermore they have mutually different 
modules, then find |r;|,.-.., |rn| by (46). Check the sign of the roots, i.e. check the possible 
solutions +r1,..., +r, in the original polynomial Po(z). 


3) If all coefficients are positive, but some of the coefficients are not eventually roughly squared, this 
is an indication of that we have roots of equal modulus. In this case, apply (48), or (49), whenever 
needed to find |ri|,... , |rn| and then check +r), ... , +r, in the original polynomial Po(z). 


4) If some of the coefficients fluctuate, this is an indication of a pair of complex conjugated roots. In 
this case, apply (51), or modifications of (51), to find |ri|,..., |rn|. For these pairs of complex 
conjugated roots, apply Vieti’s formule to find them explicitely. 


4.2.3. Examples. 


We shall give three examples, one with only real roots of mutually different modules, one with only 
real roots, but where some of the roots have a common modulus, and finally an example, where we 
have two pairs of complex conjugated roots. In two of the examples it is possible to find the roots 
directly, so we can compare the results. 


Example 4.2.1 We shall find all roots of the polynomial equation 


P(g) = a4 — 2? — 102? —2+1=0. 


We first notice that it is possible to solve this equation directly, because if x9 is a root, then xp 4 0, 


1 
and — is also a root. In fact, 
v 
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(a — 29) («-=) =a" {0 ah + 1, 


the polynomial must necessarily have the structure 


P(x) = wt —a23- 102? -x+1= (a? -ar+1) (x? — br +1) 
= a —(a+ bc? + (24+ ab)x* — (a+ dr +1. 


When we identify the coefficients we get 
a+b=1, and 2+ab=-10, ie. ab=-—-12, 


and a and b are the roots of the equation z? — (a +b)z + ab = 27 — z-12 = 0, thus 


ib =50svBT = 5047 ={ = ’ 
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and we have proved that P(«) can be factorized in the following way, 


P(x) = 24 — 2? - 102-2 +1 = (a? —42 +1) (2? +3241), 
from which we immediately get the roots 


e=2+ 22 -1=24+ V3 and z= 5{-3+ Ba} =5{-34v5l. 


Since we shall estimate the efficiency of Graeffe’s root-squaring method we notice for later use that 
the roots are approximately 


3.7321, 0.2679, —0.3820, —2.6180. 
It will be convenient also to give the modules of the roots, ordered according to their size, 
(53) |ri| = 3.7321, |r2| = 2.6180, \r3| = 0.3820, |r4| = 0.2679, 
because this is the order they should occur, when we apply the root-squaring method. 


We notice that the four modules are mutually distinct and that all four roots are real, so this should 
be the easiest case to handle. 


It follows from Table 7 that all coefficients are approximately squared, when we pass form one step to 
the next. We therefore conclude that all four roots are real, and that they are of mutually different 
modules, in particular they are all simple. 


Using k = 6 we get m = 2* = 64, and it follows from (46) that 
$4 = 4.0239 - 1036, thus |r1| = 3.7321, 


g4 _ 2-2650 - 108 


To = 4.0239 - 1036” thus T2| = 2.6180, 


64 4.0239 - 1036 


"3 = 99650 - 108° thus |r3| = 0.3820, 
64 1 
ee thus |r4| = 0.2680, 


~ 4.0239 - 1036’ 


in agreement with (53). 


Since r} +r2 +73 +74 = 1, we must have r; > 0 and rg < 0, and r; + rg = 1.1141 = 1 — 73 — 14, or 
r3 +r4 = —0.1141. This implies that rg < 0 and rg > 0, so we conclude that the roots are 


ry =3.7321, ro =—2.6180, r3=—0.3820, 74 =0.2680. © 


Example 4.2.2 Let us consider the normalized polynomial 


P(g) = a? + 22* — 5a? — 102" +4248 


of integer coefficients. Its possible rational roots are +1 
roots are 


, +2, +4,+8. A simple check shows that the 


r= 2, Lo = 243 = —2, tz=1 and t5=-l, 
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k ao ay, ag a3 a4 
1 ay ae ax | a4 
—2a2 —2a,a3 —2a2a4 
2a4 
0; 1 -—1 —10 —1 1 
1 1 100 1 1 
20 —2 20 
2 
1) 1 21 100 21 1 
1 441 10 000 441 1 
—200 —882 —200 
2 
2) 1 241 9 120 241 1 
1 5.8081 E4 8.3174 E7 5.8081 E4 1 
—1.8240 E4 1.1616 E5 —1.8240 E4 
* 
3] 1 3.9841 E4 8.3058 E7 3.9841 E4 1 
1 1.5873 EO | 6.8987 E15 1.5873 E9 1 
—1.6612 E8 | —3.1746 E9 —1.6612 E8 
4] 1 1.4212 E9 | 6.8987 E15 1.4212 E9 1 
1 2.0198 E18 | 4.7592 E31 2.0198 E18 1 
—1.3797 E16 * —1.3797 E16 
5} 1 2.0060 E18 | 4.7592 E31 2.0060 E18 1 
1 4.0240 E36 | 2.2650 E63 4.0240 E36 1 
—9.5184 E31 —9.5184 E31 
6] 1 4.0239 E36 | 2.2650 E63 4.0239 E36 1 


Table 7: The coefficients of the root-squaring method of Example 4.2.1. 


Download free eBooks at bookboon.com 


m = 2 


ao ay ag a3 a4 a5 
1 De as ax ee az 
—2ag —2a1a3 —2a2a4 —2a305 
2a4 2a, a5 
1 1 2 —5 —10 4 8 
1 4 25 100 16 64 
10 40 40 160 
8 32 
2 1 14 73 172 176 64 
1 196 5.329 E3 2.9584 E4 3.0976 K4 4.096 E3 
—146 —4.816 E3 —2.5696 £4 —2.2016 E4 
0.358 E3 0.1792 4 
4 1 50 865 5 680 8 960 4096 
1 2500 7.4823 ES 3.2262 E7 8.0282 E7 1.6777 E7 
—1 730 —5.6800 E5 | —1.5501 E7 —4.6531 E7 
0.1792 K5 0.0410 E7 
8 1 770 1.9815 E5 1.7171 E7 3.3751 E7 1.6777 E7 
1 5.9290 E5 3.9261 E10 2.9485 E14 1.1391 E14 | 2.8148E14 
—3.9629 E5 —2.6444 E10 | —0.1338 E14 —5.7617 E14 
0.0067 E10 0.0003 E14 
16 1 1.9661 E5 1.2885 E10 2.8150 E14 5.6296 E14 | 2.814814 
1 3.8655 E10 1.6603 E20 7.9243 E28 3.1693 E29 | 7.922828 
—2.5771 E10 —1.1069 E20 | —0.0015 E28 —1.5847 E29 
* * 
32 1 1.2885 E10 5.5340 E19 7.9228 E28 1.5846 E29 | 7.9228 E28 
1 1.6602 E20 3.0625 E39 6.2771 E57 2.5108 E58 | 6.2771 E57 
—1.1068 E20 —2.0417 E39 * —1.2554 E58 
64 1 5.9341 E19 1.0208 E39 6.2771 E57 1.2554 E58 6.2771 57 
1 3.0628 E39 1.0420 E78 | 3.9402 E115 1.5760 E116 | 3.9402 £115 
—2.0416 E39 0.6948 E78 * 0.7880 E116 
128 1 1.0212 E39 3.4720 E77 | 3.9402 E115 7.8800 E115 | 3.9402 E115 
1 1.0428 E78 1.2055 E155 | 1.5525 E231 6.2094 E231 | 1.5525 E231 
—0.6944E78 | —0.80475 E155 * —3.1050 E231 
256 1 3.4840 E77 4.0075 E154 | 1.5525 E231 3.1044 E231 | 1.5525 E231 


Table 8: The coefficients of the root-squaring method of Example 4.2.2. 
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sO £2 = x3 is a double root, x71, v2, 73 have the same modulus 2, and the two simple roots x4 and x5 
have the same modulus 1. All five roots are real. We shall see how Graeffe’s root-squaring method 
can show that three of the roots have modulus 2, and the remaining two roots have the modulus 1. 
All derived coefficients in Table 8 are positive, so we conclude that all five roots are real. When we 
compare the two lines corresponding to k = 7 and k = 8 we see that 


lis 1 4 1 > 
a1,3 ~ 3 a175 a2,3 ~ 3 a275 d4,3 ~ 3 4,75 


so three of the roots have the same modulus, and the remaining two have the same modulus, i.e. 


|r1| = |x2| = |xs3| and |v4| = |r5] - 
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ee: a i Vas, ee 

Q1,k Q2,k Q3,k @5,k 
1 2 2.2835 1.5350 1.0116 | 0.6030 
2 4 2.0394 1.6008 1.1207 | 0.8223 
3 8 2.0013 1.7467 1.0881 0.8397 
4 16 2.0000 1.8673 1.0443 | 0.9576 
5 32 2.0000 1.9325 1.0219 | 0.9786 
6 64 2.0000 1.9660 1.0109 | 0.9892 
7 128 2.0000 1.9829 1.0054 | 0.9946 
8 256 2.0000 1.9915 1.0027 | 0.9973 


Table 9: Estimates of the modules of the roots in Example 4.2.2. 


From Table 9 we derive that |x| = |x2| = |v3| = 2 and |a4| = |a5| ~ 1. In order to get 1 in the 
latter case we should proceed with the computations another one or two steps, but most people would 
already at this stage judge that the modulus is indeed 1. Finally, by inserting +1, +2 in the original 
equation we get as before 


Ly =2, Lo = #3 = —2, r4 = 1, ti =—l. © 


Example 4.2.3 Finally, we shall find the roots of the polynomial equation 


(54) 2” + 2° — 4x° — 4e4 — 243 — 5a? —- 2 —1 =0, 


by using Graeffe’s squaring method. 


We first check for possible rational roots. These can only be +1, and neither of them are roots for 
obvious reasons, because the sum of the coefficients is an odd number. 


One may also investigate if there are multiple roots. We shall not write down the tedious details, only 
mention that there are no multiple root in this case. 


Then we use the Graeffe’s root-squaring method to set up Table 10. 


It follows from Table 10 that from row k = 7, i.e. m = 2° = 128, all coefficients are uninfluenced 
by the product terms with the exception of the fifth and the seventh coefficients, which continually 
fluctuate in sign. We can therefore terminate the root-squaring process at this stage, concluding that 
there must be two pairs of complex roots and three real roots. Furthermore, the real roots must be 
of mutually different modules, because the transition from one step to the next one is approximately 
a squaring. 


It follows from r?°° = 5.6033 - 20°° that 


1 
In |ri| = 555 In (5.6033 - 10°°) =0.7713, hence |r1| = 2.1625. 


It follows from r7°° - r3°° = 1.6806 - 201°9 that 


In |r2| = = {In (1.6806 - 10'°°) — In (5.6033 - 10°?) } = 0.6609, hence |r| = 1.9365. 
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ao ay a2 a3 a4 a5 a6 az 
1 a; as aa an az aa as 
—2a2 —2a a3 —2a2a4 —2a3a5 —2a4a6 —2as5a7 
2a4 2a1a5 2a2a6 2a3a7 
—2a6 —2a1a7 
1 1 —4 —4 —2 —5 —-1|-1 
1 16 16 4 25 i 1 
+8 +8 —16 —40 —4 —10 
—4 —10 +8 8 
+2 +2 
1 9 20 —8 —26 29 -9 1 
81 400 64 676 841 81 1 
—40 144 1040 464 —468 —58 
—52 522 —360 —16 
18 —18 
1 Al 492 1644 762 357 23 1 
1681 2.4206 E5 2.7027 E6 5.8064 E5 1.2745 E5 529 1 
—984 —1.3481E5 | —0.7498 E6 —1.1738 E6 —0.3505 E5 —714 
0.0152 E5 0.0293 E6 0.0226 E6 0.0329 E5 
* * 
1 697 1.0877 E5 1.9822 K6 —5.7062 E5 9.5685 E4 —185 1 
4.8581 E5 11.8309 E9 3.9261 E12 32.5607 E10 9.1556 E9 0.3423 E5 1 
—2.1754 E5 —2.7632 E9 0.1241 E12 | —37.9334 E10 —0.2111 E9 —1.91387 E5 
—1.1412 E6 0.1334 E12 —0.0040 E10 0.0040 E9 
1 2.6827 E5 9.0666 E9 4.0534 E12 —5.3767 E10 8.9485 E9 —1.5715 E5 1 
7.1969 E10 8.2203 E19 1.6430 E25 0.2891 E21 8.0076 E19 24.6961 E9 
—1.8133 E10 | —0.2175 E19 9.7497 E20 —7.2544 E22 | —1.6899E16 | —17.8970 E9 
x * * x 
1 5.3836 E10 8.0028 E19 1.6431 E25 —6.9653 E22 8.0059 E19 6.7991 E9 1 
2.8983 E21 6.4045 E39 2.6998 E50 4.8515 E45 6.4094 E39 4.6228 E19 
—0.1601 E21 | —1.7692 E36 * —2.6309 E45 * —16.012 E19 
1 2.7383 E21 6.4027 E39 2.6998 E50 2.2206 E45 6.409439 | —1.1389 E20 1 
1 7.4983 E42 4.0995 E79 | 7.2889 E100 4.9311 E90 4.1080 E79 1.2971 E40 1 
—0.0128 E42 * * —3.4608 E90 * —1.2819 E38 
1 7.4855 E42 4.0995 E79 | 7.2889 E100 1.4702 E90 4.1080 E79 1-5213 E38 1 
5.6033 E85 | 1.6806E159 | 5.3128 E201 2.1615 E180 | 1.6876 E159 2.3144 E76 1 
* * * —5.9886 E180 * —8.2160 E79 
1 5.6033 E85 | 1.6806 E159 | 5.3128E201 | —3.8271 E180 | 1.6876E159 | —8.2137 E79 1 
Table 10: The coefficients of the root-squaring method of Example 4.2.3. 
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a 
It follows from 735° = 35 that 
a 


In |r3| = {In (5.3128 - 107°") — In (1.6806 - 10°?) } = 0.3823, hence |r3| = 1.4625. 


Since there is only one change of sign in (54) between successive coefficients, only one of the three 
roots is positive. It follows from P(2) = —39 and P(3) = 1517 that the positive root must lie between 
2 and 3. Thus we conclude that the three real roots are approximately 


ry = 2.1625, rg = —1.9365, r3 = —1.4656. 
Then we compute the modules of the complex roots. In the first case, 
5.8 512/ 1.6876 - 10159 
ey si2/ = 0.82 
m4 "Vas8 V 5.3128 - 10201 Oren 


and in the second case, 


a7.8 512 1 
mw 512/08 — = 0.4887. 
can | eee V 1.6876 191s9 — 0.4887 


Using Vieta’s equations we see that the real parts, u4 and us, satisfy 


i 
(55) us + us = 5 {-1 — (2.1625 — 1.9365 — 1.4656)} = 0.1150, 


and 
dn—1 = (—1)"~"{sum of the products of the roots taken n — 1 at a time}. 
In the given case, n = 7 and ag = —1, thus 


—-1 = rors (us + iv4) (ug — tv4) (5 + i105) (Us — tvs) 

+ryr3 (ua + tv4) (us — v4) (us + tus) (us — iv5) 

+ryr2 (us + tv4) (us — v4) (us + tus) (us — iv5) 

+ryrers {(ua — tua) (us + tvs) (Us — ts) + (ug + tg) (Us + v5) (us — tvs) } 
+ryror3 {(ua + iva) (ua — iv4) (us — 005) + (us + tv4) (ua — iva) (us + t05)} 
= (rire +rer3 +1371) (uj oP vi) (uz + vs) 

+rirer3 {2uy (uz + vs) + 2us (uj + vi)} 


2 2 2 2 
= (riret+rar3 +r3ri)rg +75 +2 (ware + usr4) rrers. 


It follows by a rearrangement that 


1 1 1 1 
) 2. ) 2. = 2 mad : 
(56) 15 U4 + 274+ Us es T4"5 =a 


We insert r1 = 2.1625, rg = —1.9365, r3 = —1.4656, r4 = 0.8260 and rs = 0.4887 into (56) to get 


0.4777 - us + 1.3646 - us = —0.1629 + 0.1200 = —0.0429, 
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so together with (55) we get the system 


| U4 oT UU = 0.1150, 


0.4774-ug + 1.3646-u5 = —0.0429, 


so using Cramer’s solution formula, 


| 0.1150 1 | 
—0.0429 1.3646 0.1998 
= = = 0.2252 
ue 1 1 0.8872 — 0°2252: 
0.4774 1.3646 
and 
1 0.1150 
0.4774 —0.0429 —0.0978 _ 


us = = 0.1102. 
es 1.3646 


1 1 | 0.8872 
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Then finally, 


v4 = 1/72 — u2 = 0.82602 — 0.2252? = 0.7947, 


and 


Us = 4/2 — u2 = V/0.48872 — 0.1102? = 0.4761, 


and the complex roots are 


0.2252 + 7 - 0.7947 and — 0.1102 +7- 0.4761. ©) 


The examples above show that the real roots are fairly easy to compute. In case of pairs of complex 
roots we first find the modulus r,,, and then write Um +ivm, where r2, = u2, +v2,. Use always Vieti’s 
formula 


—a, = sum of all roots, 


when we have at least one pair of complex roots. If we have two pairs of complex roots, then we 
should also include 


(—1)"~'a,_1 = sum of all products of the roots taken n — 1 at a time. 


If there are three pairs of complex roots, we include another one of Vieti’s formule, in which case the 
new system becomes more difficult to solve, due to the non-linearity, etc.. 


The worst case is of course when all roots of the real polynomial of degree 2n are complex, in which case 
we shall include n of Vietz’s 2n equations, of which only two are linear in the real parts ui, ..., Un. 
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5 Appendix 


5.1 The binomial formula 


We shall for completeness prove the binomial formula 
(57) (a+r = (% Jarry for a, bEC, nENo. 
j=0 


We use here the well-known notations c° := 1, and 


a n! _ 1-2---n a = 
GG sae epee Od 


ProorF. If a = 0, we get by the conventions above b” on both sides of the equality (57). 
If a £ 0, it follows by dividing by a” that it suffices to prove (57) for a = 1, i.e. we divide by a” and 
replace — by b to reduce the claim to 

a 


(58) a+ = 30 (" \o bEC and nENo. 


j=0 
We shall prove (58) by induction. 
When n = 0, we just get 1 = 1 by the conventions above. 


When n = 1, we get 


1 
(14b)'=1+6 and (5 )H= (5 Jet ( | oars. 


Therefore, (58) holds for at least n = 0 and n= 1. 


Assume that (58) holds for some n € N. Then, using this assumption, 


n 


(1+0)"t! = a+y-a+y"=a+H >> ( i )o 


EG) EG EG eG )¥ 


We notice that 


(JCS) me (28). 
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n n _ n! n! nl ro 
Gee 7 jot Golinielaat jWmaeiage eek 


—  al(nt+1) _ (n+)! aCe) 
j(n+1— 9)! jl(n41-—j) , 


Hence, finally 


voor = (3) ECS) Gt eae 


= (SE (70052 eT 


j=0 


and (58 follows by induction. 


Remark 5.1.1 It is not hard, using a similar proof, to prove Fuler’s rule of differentiation of a 
product of two C”-functions f and g, 


(9) Ue) = (5) -o 


dz”-j_ dzi’ 


h 
where we have put —~(z) := h(z). 0 
dz 


5.2 The identity theorem for convergent power series 


Theorem 5.2.1 Two complex power series 


+00 +00 
= S- One and g(z) = S- by 2”, 
n=0 n=0 


which are both convergent in the same nonempty disc |z| <r, and here are equal to each other, have 
the same coefficients, 1.e. 

On. = by for alln € No. 
ProoFr. The proof follows the same patterns as the proof of Theorem 1.1.1. Let f(z) = g(z) for all 


|z| <r. Putting z = 0 we get 


do = f (0) = g(0) = bo, 


sO 
+00 +00 +00 
f(z) -—a = an = 2 aaa” = g(z) — bo = # > bass gh 
n=1 n=0 n=0 
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for all |z| < r. Hence, for 0 < |z| <r, 


+oo +oo +oo 
y Qn412" =a, +2 y AQn422" =b, +2 y bate 2”, 
n=0 n=0 


n=0 
sO 
+00 +00 
(60) a, — b; = z S- bn4g 2” — S- An42 2” for 0 < |z| <r. 
n=0 n=0 


The left hand side of (60) is a constant, while the right hand side tends towards 0 for z — 0. Hence 
a1 = by, and it follows from (60) that 


+oo +oo 
y bn42 2” = y An42 2” for 0 < |z| <r. 
n=0 n=0 


Repeating this argument we get successively, 


ay = be, a3 = bs, ae Gn = bn, Bic aNar2g 


and the theorem follows. 
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Example 5.2.1 The power series method. Using the identity theorem we can describe the power 
series solution method of differential equations of polynomial coefficients. We shall illustrate this 
important method by the very simple example 


(61) f'(z)—2xf(z)=0, xeER, f(0)=1, 
the solution of which is of course f(x) = exp (27). 


When we insert into (61) the formal power series 
+oo +00 
f(z) = San” and fa= nae 
n=0 n=1 


where the latter is obtained by termwise differentiation of the former, we get formally 


00 +00 +oo +oo 
0 = f'(x)-2a f(x) = oe an" So » an = > (n+2Qdansea" — S- 9a,a°" 
n=1 n=0 n=—-1 n=0 
+00 
= at S- {(n+ Dance —2anha2. 
n=0 


Since this is the zero polynomial, it follows from Theorem 5.2.1, The identity theorem, that all coeffi- 
cients are zero, thus a, = 0 and 


2 


(62) (n+ 2)an42 — 2a, =0, neENo, thus An42 = ae An 


, nENo. 


From a; = 0 and (62) follow by induction that az,41; = 0 for every n € No. When the index is even, 
n = 2m, then we get from (62) that 


(2m + 2)dom+2 = 2a2m => 0, thus (m + 1)a2(m-+1) = 02.m; ME N. 


We multiply the latter formula by m! 4 0, and then we get by recursion, 


(m +1)! @om41) = M! dam = ++: = 1ae.1 = 0!a9 = ag = f(0) = 1. 


Solving with respect to dam we get 


Qm = — hence Aon = and dan41=0 for nE No. 


ml’ n! 


Therefore, the formal power series solution becomes 


— melee | 2 
f(x) = Daan a” =D) (2?) = exp (2*), 
n=0 n=0 ~ 


where we at last recognize the power series expansion of the exponential, so the formal solution is also 
the correct solution. © 
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5.3 Taylor’s formula 


We shall here prove Taylor’s formula, first in general, and then restrict ourselves to polynomials. 


Theorem 5.3.1 Taylor’s formula. Let f € C"t!(I) be an n+ 1 times continuously differentiable 
function on an open real interval, and let a € I be a given point. Then, for x € I, 


n! 


"a "a (Mg xz ¢(n+1) 
g(a) = #02) + 2 (oa) 4 FO ear AO ears PEO we _yrae 


ProoF. We shall here give a direct proof. If x is considered as a constant, then t — a is an integral 
of 1, and it follows by a series of successive partial integrations that 


fe) = f+ fr reoa=so+[F*-r0] - [Ferro 


1! 
= #a)+ 28) ea) - |S prea] + [SP wae 
"a "a —x)3 % z(¢_ w)3 
= a) + 2B) ea) + FP ea)? + |S po] - [PS rower 
"a "a (n) a Z (a — t)n 
= f(a) rr ) (x a) rr ) (x a)? fee a f | Ke ri i ( —H | Soamel (| dt, 


and the theorem is proved. 


Theorem 5.3.1 holds in particular for polynomials P(x) in the real variable « € R. We get an even 
better result, because if P(«) has degree n, then clearly P‘*1)(t) = 0, so the error term (the final 
integral) disappears, and we have 


Pa) 


P"(a) 
il (a —a)4 


(63) P(a)P(a) 4 aI 


(@ =a)? +a004 (x —a)”, for all z € R, 


without the error term. 


By the identity theorem, Theorem 1.1.1, the coefficients of a polynomial are unique, so (63) also holds, 
when zx € R is replaced by z € C. 


Finally, we leave as an exercise to the reader to prove (63), when a € C is a complex constant. 


5.4 Weierstra8’s approximation theorem 


It is the author’s experience that this very important theorem is not too well-known in general. It 
is, however, due to this theorem in many cases possible to work only with approximating polyno- 
mials instead of with more general continuous functions, so it plays indeed a very important role in 
Mathematics. 
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Theorem 5.4.1 Weierstraf’s approximation theorem. Let f be a continuous function on the bounded 
closed real interval [a,b]. Then there exists a sequence of polynomials {P,,}, which converges uniformly 
on [a,b] towards f. 


That P, — f uniformly on [a,b] for n — +oo means that to every € > 0 there exists an no € N, such 
that 

| f(x) — Pr(x)| <e for all n > no and all x € [a, b], 
i.e. the graphs of P,, lie eventually in an ¢-tube around the graph of the continuous function. 
Remark 5.4.1 We shall here give the customary proof, which for given ¢« > 0 and corresponding 


no € N explicitly defines the approximating Bernstein polynomials, so the proof of Theorem 5.4.1 is 
actually constructive. 0 


PROOF. Clearly, we can approximate the real and the imaginary parts of f separately, so without 
loss of generality we may assume that f is a real function. 
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Then notice that the linear transformation g : [0,1] — [a,b], given by 
g(t) =at+(b-a)t, te (0,1), 
makes f og a continuous function of [0,1], so we may assume from the beginning that [a,b] = [0,1]. 


Since f is continuous on the bounded closed interval [0,1], it follows from one of the main theorems 
of continuous functions that f is uniformly continuous on [0,1]. This means that given ¢ > 0 there 
exists a 6 > 0, such that 


(64) |f(x) — f(y)| < 5) whenever x, y € [0,1] and |” — y| < 6. 


Let in the following ¢ and 6 be given, such that (64) is fulfilled, and define the Bernstein polynomials 
B,,,¢(t) corresponding to the function f, in the following way, 


Bn, p(t) - 1G) ( H }#a-o", néN. 


We shall prove that 
(65) |f(t) -— Br p(t)| <e for all t € [0,1] and all n > no, 


where 


(66) no := If(@I. 


—z Max 
e€- 6? te(0,1] 


We shall need the following 


Lemma 5.4.1 
n 2 
yo {e-=} ( i ) #a-9r* = a) for t € [0,1]. 
k=0 
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PROOF OF LEMMA 5.4.1. If one is familiar with the variance of the binomial distribution from 
Probability Theory, this is trivial. If not, one proceeds in the following way: 


n 


El S} (f )ea-ort= fea te} CG )eanom 


k=0 k=0 


n—1 n—1 
— a1 \ ee ak gS M1 kee pao 
=f 12 y( ; )ea t) a a| ‘ )ea t) 


nme (n — 2)! od 
a 2. =D —e - 


2n—2 
42 942 yn-1 1 n-1 i n—2 k (4 _ 4\n—-2-k 
=t 2t* +1 +—-t-l +41 y i t“ (1-t) 
n n 


k=0 


1 1 1 
= -2¢ 4 +f bees laa, 
n nm nm 


sO 


0< ; fe-2V (8 Va ort 08 gt for t € [0,1]. 
k=0 


Returning to the proof of Weierstraf’s approximation theorem, we choose 


If@I, 


n> —> max 
— €+ 6? tE[0,1] 
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and then we get by (64) and Lemma 5.4.1 the following computation, 


) — Ba, s(O1 = no 4era-oy-or(*) (7 )ea-am 


IA 
M 
+ 
M 
ps 
wae) 
o 
mam ™~ 
15 


IA 


e 2 1 
=—+ — t)|- —t11—-t 

5+ 5 pe " (1 —t) 

Z 2 6-62 1 ¢€ ¢€ 

ee Caaermnert t é . = + =€, 
Sot Bes lOl maxeolfOl 422° 


assuming that f is not identical 0. It is of course trivial, when f = 0. 
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argument function, 54, 55 logarithm, 55 

argument principle, 58, 60 logarithmic derivative, 58 

argument variation, 56 multiple root, 13 

Banach’s fix point theorem, 68, 70 multiplicity of root, 13 

Bernstein polynomials, 83 Newton's approximation formula, 68 
binomial equation, 19 Newton's iteration method, 69, 70 
binomial formula, 6, 79 normalized polynomial, 8, 9, 41 
Cardano’s formula, 24 power series method, 81 
contraction, 68 real polynomial, 7, 39 

contraction factor, 68 reflection in a point, 9 

dAlembert’s theorem, 10 Rolle’s theorem, 15, 17 

degree of polynomial, 6 root, 10 

Descartes’s theorem, 40, 43, 47 Rouché’s theorem, 54, 61 

division algorithm, 28 Rouchés theorem for polynomials, 60 
divisor, 23 Schur’s criterion, 65 

double root, 13 similarity, 9 

elementary symmetric polynomials, 14 simple root, 13 

Euclidean algorithm, 28, 29, 32, 36, 50 Sturm chain, 51 

Euler’s rule of differentiation, 80 Sturm’s theorem, 50, 52 

fix point, 68 Taylor expansion, 8 

Fourier-Budan’s theorem, 46, 47, 54 Taylor’s formula, 82 

fundamental theorem of algebra, 10, 12, 18, 19, 51, 60-63 _ translation, 8 

Graeffe’s squaring method, 62 uniform continuity, 83 

greatest common divisor, 29 Vieti’s formule, 13 

Hurwitz polynomial, 63, 65 Weierstraf’s approximation theorem, 83 
Hurwitz’s criterion, 66 winding number, 56-58 

identity theorem, 7, 81, 83 zero, 10 

identity theorem for convergent power series, 80 zero polynomial, 7 


inversion, 9 
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